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BACKGROUND OF THE INVENTION 

Connective tissue growth factor (CTGF) is a 
growth factor expressed by endothelial and fibroblastic 
cells. Two members of the CTGF family are known, CTGF 
(U.S. Patent 5,585,270 and U.S. Patent 5,408,040) and 
CTFG-2 (WO 96/01896), incorporated herein by reference. 
CTGF belongs to a family of growth factors that includes 
CTGF, CTGF-2, insulin binding proteins (IBP) 1 and 2. 
These growth factors have a cysteine-rich motif and 
several other structural protein domains in common, and 
have been demonstrated to have a role in cell 
proliferation, differentiation, and chemotaxis. 

Studies have suggested that CTGF is 
immunologically similar to PDGF, with PDGF antibodies from 
both A and B chains binding to CTGF (Bradham et al . , J . of 
Cell Biol. 114 (6) :1285-1294, 1991), and it has been 
reported that biological activity of CTGF can be blocked 
using these antibodies. 

It has recently been demonstrated that porcine 
CTGF isolated from uterus is mitogenic for fibroblasts and 
smooth muscle cells but not endothelial cells. In 
addition, a N-terminally truncated form of the protein was 

active as well (Brigstock et al . , J, Biol. Chem. 

272 : 20275-20282 , 1997) . It has been suggested that this 
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protein could play a role in the growth and remodeling of 
the endometrium and placenta. 

The CTGF family is believed to play a role 
production of the extracellular matrix components, such as 
5 collagen and fibronectin. Collagen and fibronectin are 
components of many connective tissues, e.g., ligaments, 
cartilage, tendons and vessel walls. Current therapy for 
ligament repair is limited to immobilising the damaged 
tissue, i.e., stapling, or replacing the damaged tissue 

10 with synthetic or natural grafts. These tissues are 
notoriously difficult to heal, even in healthy 
individuals, and compositions that would improve recovery 
time would be very valuable. 

The present invention is directed to novel 

15 polypeptides and polynucleotides encoding the polypeptide 
that show predominantly high expression in testis, trachea 
and bone marrow, thus providing a new molecule for 
regulating growth, differentiation, chemotaxis and 
induction of specialized cell functions in these tissues. 

20 

SUMMARY OF THE INVENTION 

It is an object of the present invention to 

provide an isolated polynucleotide molecule comprising a 

polynucleotide sequence that encodes a connective tissue 
25 growth factor homolog polypeptide that is at least 70% 

identical to the amino acid sequence as shown in SEQ ID 

NO: 2 from residue 24 to residue 354. 

In another embodiment, the present invention 

provides for a polynucleotide molecule wherein the 
3 0 polynucleotide molecule comprises a region having the 

following motif as shown in SEQ ID NO: 23: 

Cx{8,10}CxCCxxCx{7}Cx{5,6}Cx{5,7}Cx{l2,13}Cx{7,8}Cx{20} 

CxCx{6} Cx{l2,14}Cx{l3,17}C 

wherein x{ } is the number of amino acid 

35 residues between cysteines (C) . 
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In another embodiment, the polynucleotide is 80 
or 90% identical to the amino acid sequence as shown in 
SEQ ID NO: 2 from residue 24 to residue 354. 

It is also the object of the present invention 
to provide an isolated polynucleotide acid molecule that 
encodes a connective tissue growth factor homolog 
polypeptide, wherein the polynucleotide molecule is 
selected from the group consisting of (a) a molecules 
having the nucleotide sequence of SEQ ID NO:l from 
nucleotide 17 or 86 to nucleotide 1078, (b) a molecule 
encoding the amino acid sequence of SEQ ID NO: 3 -from 
nucleotide 1 or 70 to nucleotide 1062, and (c) a molecule 
that hybridizes under stringent wash conditions to a 
polynucleotide molecule having the nucleotide sequence of 
nucleotides 86 to 1078 of SEQ ID NO:l, or the complement 
of nucleotides 86 to 1078 of SEQ ID N0:1. 

In another embodiment, the differences in the 
amino acid sequence encoded by the polynucleotide and SEQ 
ID NO: 2 are conservative amino acid changes. 

In other aspects, the present invention provides 
an expression vector comprising the following operably 
linked elements: a transcription promoter; a DNA segment 
comprising the isolated polynucleotide sequence that 
encodes a connective tissue growth factor homolog 
polypeptide that is at least 70% identical to the amino 
acid sequence as shown in SEQ ID NO: 2 from residue 24 to 
residue 354; and a transcription terminator and a cultured 
host cell into which has been introduced the expression 
vector. 

In another embodiment, the present invention 
provides a method of producing a connective tissue growth 
factor homolog polypeptide comprising: (a) culturing the 
host cells of expressing the CTGF homolog polypeptide; and 
(b) isolating the connective tissue growth factor homolog 
polypeptide from the cultured host cells. 

In another aspect, the present invention 
provides an isolated connective tissue growth factor 
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polypeptide comprising an amino acid sequence that is at 
least 70% identical to the amino acid sequence as shown in 
SEQ ID NO: 2 from residue 24 to residue 354. 

In other embodiments, the present invention 
5 provides CTGF homolog polypeptides wherein the amino acid 
sequence is at least 80% or 90% identical. 

In another embodiment, the CTGF homolog 
polypeptide molecule comprises a region having the 
following motif as shown in SEQ ID NO: 23: 
10 Cx{8 / 10}CxCCxxCx{7}Cx{5 / 6}Cx{5 / 7}Cx{l2 / 13}Cx{7 / 8}Cx{20} 

CxCx{6}Cx{l2 / 14}Cx{l3 / 17}C 

wherein x{ } is the number of amino acid 
residues between cysteines (C) . 

In another aspect, the present invention 
15 provides an antibody or antibody fragment that 
specifically binds with the CTGF homolog polypeptide. 

In another aspect, the present invention 
provides a method of detecting the presence of connective 
tissue growth factor homolog polypeptide in a biological 
20 sample, comprising the steps of: (a) contacting the 
biological sample with an antibody, or an antibody 
fragment of claim 14, wherein the contacting is performed 
under conditions that allow the binding of the antibody or 
antibody fragment to the biological sample, and (b) 
25 detecting any of the bound antibody or bound antibody 
fragment . 

In another aspect, the present invention 
provides an anti-idiotype antibody, or anti-idiotype 
antibody fragment, that specifically binds with the 
30 antibody or antibody fragment. 

In another aspect, the present invention 
provides a method of detecting a chromosome 6q abnormality 
in sample from an individual comprising: (a) obtaining 
ZCTGF4 RNA from the sample; (b) generating zCTGF4 cDNA by 
35 polymerase chain reaction; and (c) comparing the nucleic 
acid sequence of the zCTGF4 cDNA to the nucleic acid 
sequence as shown in SEQ ID NO: 1. 
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In another embodiment, the present invention 
provides that the difference between the sequence of the 
zCTGF4 cDNA or zCTGF4 gene in the sample and the zCTGF4 
sequence as shown in SEQ ID NO: 1 is indicative of 
5 chromosome 6q abnormality. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a multiple alignment of 
human zCTGF4 (SEQ ID NO: 2) and other members of the 
10 connective tissue growth factor family that includes, 

mouse ZCTGF2 (SEQ ID NO: 5) , human NOV (NOV HU; SEQ ID 
NO: 25), human CTGF1 (CTGF H; SEQ ID NO: 26), human 
insulin binding protein 1 (IBP1 H; SEQ ID NO: 27) , and 
human insulin binding protein 2 (IBP2 H; SEQ ID NO: 28) . 

15 

DETAILED DESCRIPTION OF THE INVENTION 

Prior to setting forth the invention in detail, 
it may be helpful to the understanding thereof to define 
the following terms : 
20 The term "affinity tag" is used herein to denote 

a polypeptide segment that can be attached to a second 
polypeptide to provide for purification or detection of 
the second polypeptide or provide sites for attachment of 
the second polypeptide to a substrate. In principal, any 
25 peptide or protein for which an antibody or other specific 
binding agent is available can be used as an affinity tag. 
Affinity tags include a poly-histidine tract, protein A 
(Nilsson et al . , EMBO J . 4:1075, 1985; Nilsson et al . , 
Methods Enzvmol . 198:3, 1991), glutathione S transferase 
30 (Smith and Johnson, Gene 67:31, 1988), Glu-Glu affinity 

tag (Grussenmeyer et al . , Proc . Natl . Acad. Scd^ USA 

82.: 7952-4, 1985), substance P, Flag™ peptide (Hopp et al., 
Biotechnology 6:1204-10, 1988), streptavidin binding 
peptide, or other antigenic epitope or binding domain. 

3 5 See, in general, Ford et al . , Protein Expression — and 

Purification 2: 95-107, 1991. DNAs encoding affinity tags 
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are available from commercial suppliers (e.g., Pharmacia 
Biotech, Piscataway, NJ) . 

The term "allelic variant" is used herein to 
denote any of two or more alternative forms of a gene 
5 occupying the same chromosomal locus. Allelic variation 
arises naturally through mutation, and may result in 
phenotypic polymorphism within populations. Gene 
mutations can be silent (no change in the encoded 
polypeptide) or may encode polypeptides having altered 
10 amino acid sequence. The term allelic variant is also 
used herein to denote a protein encoded by an allelic 

variant of a gene. 

The terms " amino - t erminal " and " carboxyl - 
terminal" are used herein to denote positions within 
15 polypeptides. Where the context allows, these terms are 
used with reference to a particular sequence or portion of 
a polypeptide to denote proximity or relative position. 
For example, a certain sequence positioned carboxyl - 
terminal to a reference sequence within a polypeptide is 

2 0 located proximal to the carboxyl terminus of the reference 

sequence, but is not necessarily at the carboxyl terminus 
of the complete polypeptide. 

The term "complement/anti-complement pair" 
denotes non-identical moieties that form a non-covalently 
25 associated, stable pair under appropriate conditions. For 
instance, biotin and avidin (or streptavidin) are 
prototypical members of a complement / ant i - complement pair. 
Other exemplary complement/anti-complement pairs include 
receptor/ligand pairs, antibody /antigen (or hapten or 

3 0 epitope) pairs, sense/antisense polynucleotide pairs, and 

the like. Where subsequent dissociation of the 

complement / ant i- complement pair is desirable, the 
complement/anti- complement pair preferably has a binding 

affinity of <10 9 NT 1 . 
35 The term "complements of a polynucleotide 

molecule" denotes a polynucleotide molecule having a 
complementary base sequence and reverse orientation as 
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compared to a reference sequence. For example, the 
sequence 5 1 ATGCACGGG 3 1 is complementary to 5 1 CCCGTGCAT 
3 » . 

The term "contig" denotes a polynucleotide that 
5 has a contiguous stretch of identical or complementary 
sequence to another polynucleotide. Contiguous sequences 
are said to "overlap" a given stretch of polynucleotide 
sequence either in their entirety or along a partial 
stretch of the polynucleotide. 
10 The term "degenerate nucleotide sequence" 

denotes a sequence of nucleotides that includes one or 
more degenerate codons (as compared to a reference 
polynucleotide molecule that encodes a polypeptide) . 
Degenerate codons contain different triplets of 
15 nucleotides, but encode the same amino acid residue (i.e., 
GAU and GAG triplets each encode Asp) . 

The term "expression vector" is used to denote a 
DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to 
additional segments that provide for its transcription. 
Such additional segments include promoter and terminator 
sequences, and may also include one or more origins of 
replication, one or more selectable markers, an enhancer, 
a polyadenylation signal, etc. Expression vectors are 
25 generally derived from plasmid or viral DNA, or may 
contain elements of both. 

The term "isolated", when applied to a 
polynucleotide, denotes that the polynucleotide has been 
removed from its natural genetic milieu and is thus free 
3 0 of other extraneous or unwanted coding sequences, and is 
in a form suitable for use within genetically engineered 
protein production systems. Such isolated molecules are 
those that are separated from their natural environment 
and include cDNA and genomic clones. Isolated DNA 

3 5 molecules of the present invention are free of other genes 
with which they are ordinarily associated, but may include 
naturally occurring 5' and 3' untranslated regions such as 
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promoters and terminators. The • identification of 

associated regions will be evident to one of ordinary 
skill in the art (see for example, Dynan and Tijan, Nature 

316:774-78, 1985) . 
5 An "isolated" polypeptide or protein is a 

polypeptide or protein that is found in a condition other 
than its native environment, such as apart from blood and 
animal tissue. In a preferred form, the isolated 

polypeptide is substantially free of other polypeptides, 
10 particularly other polypeptides of animal origin. It is 
preferred to provide the polypeptides in a highly purified 
form, i.e. greater than 95% pure, more preferably greater 
than 99% pure. When used in this context, the term 
"isolated" does not exclude the presence of the same 
15 polypeptide in alternative physical forms, such as dimers 
or alternatively glycosylated or derivatized forms. 

The term "operably linked", when referring to 
DNA segments, indicates that the segments are arranged so 
that they function in concert for their intended purposes, 
20 e.g., transcription initiates in the promoter and proceeds 
through the coding segment to the terminator. 

The term "ortholog" denotes a polypeptide or 
protein obtained from one species that is the functional 
counterpart of a polypeptide or protein from a different 
25 species. Sequence differences among orthologs are the 
result of speciation. 

"Paralogs" are distinct but structurally related 
proteins made by an organism. Paralogs are believed to 
arise through gene duplication. For example, oc-globin, p- 
30 globin, and myoglobin are paralogs of each other. 

A "polynucleotide" is a single- or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide 
bases read from the 5' to the 3' end. Polynucleotides 
include RNA and DNA, and may be isolated from natural 
35 sources, synthesized in vitro, or prepared from a 
combination of natural and synthetic molecules. Sizes of 
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polynucleotides are expressed as base pairs (abbreviated 
"bp")/ nucleotides ("nt"), or kilobases ("kb"). Where the 
context allows, the latter two terms may describe 
polynucleotides that are single -stranded or double- 
stranded. When the term is applied to double -stranded 
molecules it is used to denote overall length and will be 
understood to be equivalent to the term "base pairs" . It 
will be recognized by those skilled in the art that the 
two strands of a double -stranded polynucleotide may differ 
slightly in length and that the ends thereof may be 
staggered as a result of enzymatic cleavage; thus all 
nucleotides within a double -stranded polynucleotide 
molecule may not be paired. Such unpaired ends will in 
general not exceed 20 nt in length. 
15 A "polypeptide" is a polymer of amino acid 

residues joined by peptide bonds, whether produced 
naturally or synthetically. Polypeptides of less than 
about 10 amino acid residues are commonly referred to as 
"peptides" . 

20 The term "promoter" is used herein for its art- 

recognized meaning to denote a portion of a gene 
containing DNA sequences that provide for the binding of 
RNA polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' 

25 non-coding regions of genes. 

A "protein" is a macromolecule comprising one or 
more polypeptide chains. A protein may also comprise non- 
peptidic components, such as carbohydrate groups. 
Carbohydrates and other non-peptidic substituents may be 

30 added to a protein by the cell in which the protein is 
produced, and will vary with the type of cell. Proteins 
are defined herein in terms of their amino acid backbone 
structures; substituents such as carbohydrate groups are 
generally not specified, but may be present nonetheless. 

35 The term "receptor" denotes a cell-associated 

protein that binds to a bioactive molecule (i.e., a 
ligand) and mediates the effect of the ligand on the cell. 
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Membrane -bound receptors are characterized by a multi- 
peptide structure comprising an extracellular ligand- 
binding domain and an intracellular effector domain that 
is typically involved in signal transduction. Binding of 
5 ligand to receptor results in a conformational change in 
the receptor that causes an interaction between the 
effector domain and other molecule (s) in the cell. This 
interaction in turn leads to an alteration in the 
metabolism of the cell. Metabolic events that are linked 
10 to receptor^ ligand interactions include gene 

transcription, phosphorylation, dephosphorylation, 

increases in cyclic AMP production, mobilization of 
cellular calcium, mobilization of membrane lipids, cell 
adhesion, hydrolysis of inositol lipids and hydrolysis of 
15 phospholipids. In general, receptors can be membrane 
bound, cytosolic or nuclear; monomeric (e.g., thyroid 
stimulating hormone receptor, beta-adrenergic receptor) or 
multimeric (e.g., PDGF receptor, growth hormone receptor, 
IL-3 receptor, GM-CSF receptor, G-CSF receptor, 
20 erythropoietin receptor and IL-6 receptor) . 

The term "secretory signal sequence" denotes a 
DNA sequence that encodes a polypeptide (a "secretory 
peptide") that, as a component of a larger polypeptide, 
directs the larger polypeptide through a secretory pathway 
25 of a cell in which it is synthesized. The larger 
polypeptide is commonly cleaved to remove the secretory 
peptide during transit through the secretory pathway. 

The term "splice variant" is used herein to 
denote alternative forms of RNA transcribed from a gene. 
3 0 Splice variation arises naturally through use of 
alternative splicing sites within a " transcribed RNA 
molecule, or less commonly between separately transcribed 
RNA molecules, and may result in several mRNAs transcribed 
from the same gene. Splice variants may encode 

3 5 polypeptides having altered amino acid sequence. The term 
splice variant is also used herein to denote a protein 
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encoded by a splice variant of an mRNA transcribed from a 
gene . 

Molecular weights and lengths of polymers 
determined by imprecise analytical methods (e.g., gel 
5 electrophoresis) will be understood to be approximate 
values. When such a value is expressed as "about" X or 
"approximately" X, the stated value of X will be 
understood to be accurate to ±10%. 

All references cited herein are incorporated by 

* « 

10 reference in their entirety. 

The present invention is based in part upon the 
discovery of a novel DNA sequence that encodes a 
connective tissue growth factor homolog polypeptide, i.e., 
15 polypeptides having homology to other members of a family 
of growth factors that are secreted and contain a cysteine 
motif of the formula: 

Cx{8,10}CxCCxxCx{7}Cx{5 / 6}Cx{5,7}Cx{l2,13}Cx{7 / 8}Cx{20} 

CxCx{6}Cx{l2,14}Cx{l3,17}C 

wherein x{ } is the number of amino acid 
residues between cysteines (C) as shown in SEQ ID NO: 23. 
This motif represents a consecutive domain arrangement 
known as a insulin-like growth factor binding domain 
followed by a Von Willebrand Factor c (VWFc) domain (Bork, 
25 FEBS Letts . 327:125-130, 1993). The protein domain 
arrangement is unique to the CTGF/NOV protein structure 
family. The insulin-like growth factor binding domain is 

represented by the motif: 

Cx{8,10}CxCCxxCx{7}Cx{5,6}Cx{5,7} 

30 wherein x{ } is the number of amino acid 

residues between cysteines (C) as shown in SEQ ID NO: 24. 
This pattern is found in all known members of the 
connective tissue growth factors and insulin binding 
protein family (for example: human CTGF, human CTGF-2, 

35 human NOV, human IBPs 1 and 2), and is unique to these 
proteins . 
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Analysis of the tissue distribution of the mRNA 
corresponding to this novel DNA showed that expression was 
highest in testis, followed by apparent but decreased 
expression levels in trachea, bone marrow and kidney 
tissue. The connective tissue growth factor homolog 
polypeptide has been designated zCTGF4 . 

The ortholog from mouse has also been identified 
and designated zCTGF2 . A DNA sequence and corresponding 
putative amino acid sequence are shown in SEQ ID NOS : 4 
and 5, respectively. 

The novel zCTGF4 polypeptides of the present 
invention were initially identified by querying an EST 
database for homologous v sequences to connective tissue 
growth factor and insulin-binding proteins. A single EST 
sequence was discovered and predicted to be related to the 
connective tissue growth factor family. Sequence analysis 
of the clone from which the EST was derived revealed that 
the clone contained a defective cDNA with incorrect 
sequence. Isolation of an independent clone from a testis 
library revealed that the original cDNA clone contained 
intronic sequence at the 5' end of the EST. The 
nucleotide sequence is described in SEQ ID NO. 1 from 
nucleotide 17 to nucleotide 1078, and its deduced amino 
acid sequence is described in SEQ ID NO. 2. 

Analysis of the DNA encoding a zCTGF4 
polypeptide (SEQ ID NO: 1) revealed an open reading frame 
encoding 354 amino acids (SEQ ID NO: 2) comprising a 
secretory signal peptide of 23 amino acid residues 
(residue 1 (Met) to residue 23 (Gly) of SEQ ID NO: 2) and 
a mature polypeptide of 331 amino acids (residue 24 (Thr) 
to residue 354 (Leu) of SEQ ID NO: 2). An alternatively 
spliced variant has a 16 amino acid secretory signal 
peptide (residue 1 (Met) to residue 16 (Gly) of SEQ ID NO: 
2, and a mature polypeptide of 338 amino acid residues 
(residue 17 (Phe) to residue 354 (Leu) of SEQ ID NO: 2. 

CTGF family members are characterized by a 
multi-domain structure comprising a IBP domain (amino acid 
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residue 59 (Pro) to 102 (Tyr) of SEQ ID NO: 2) that has 
been suggested as an insulin growth factor binding domain 
(Kiefer et al . , J, Biol. Chem. 266 : 9043-9049 , 1991), a von 
Willebrand factor c domain (amino acid residue 114 (Cys) 
to 179 (Cys) of SEQ ID NO: 2) that may be involved in 
multimerization, a variable domain (amino acid 180 (Ser) 
to 208 (Lys) , that may be involved in tissue, matrix, 
growth factor or receptor specific interactions; and a 
sulfated glycoconjugate binding motif domain (amino acid 
residue 209 (Cys) to 252 (Cys) of SEQ ID NO: 2) that is 
thought to be involved in binding large macromolecules 
(Holt et al., J. Biol. Chem. 265 :2852-2855, 1990). 

CTGF has recently been demonstrated to bind IGF 
at low affinity (Kim et al., Proc. Nat. Acad. Sci. 
.94:12981-12986, 1997), and thereby expands the insulin- 
like growth factor binding protein (IGFBP) superfamily to 
include the CTGFs. The superfamily of IGFBPs now 
comprises both proteins that bind IGF high affinity (e.g., 
IGFBP 1-5) and proteins that bind IGF with low affinity 
(nov, cyr61 and CTGF) , suggesting proteins affect cell 
growth in both IGF-dependent and IGF -independent manners. 
The IGF binding potential of members of the CTGF family, 
including zCTGF4, may act as a competitive inhibitor of 
the biologically free component of IGF. ZCTGF4 may affect 
other pathways that IGFBPs play a role in, for example, 
competing with endogenous IGFBP for proteases that degrade 
the IGF/ IGFBP complex, resulting in changes in the 
circulating levels of IGF. 

A multiple alignment, as shown in Figure 1, also 
revealed that zCTGF4, like several other known members of 
the CTGF family, has a heparin binding domain that has 
been suggested as a receptor binding and dimerization 
domain. The heparin binding domain is shown in SEQ ID NO: 
2 from residue 262 (lie) to residue 295 (Phe) . The domain 
structure has been reviewed by Brigstock et al . , Ibid., 
1997 and Bork, FEBS Letts. 327 : 125-130, 1993, both 
incorporated herein by reference. 
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It is generally believed that under selective 
pressure for organisms to acquire new biological 
functions, new CTGF family members arose from duplication 
of existing genes leading to the existence of mult i -gene 
families. Family members thus contain vestiges of the 
ancestral gene, and these characteristic features can be 
exploited in the isolation and identification of 
additional family members. It has recently been 

demonstrated that N- terminally truncated CTGF molecules 
isolated from ' uterine secretory fluids have mitogenic 
activity and will bind heparin (Brigstock et al . , ibid., 
1991), and may have activity. An N-terminally truncated 
z CTGF 4 molecule would be expected to have similar 
activity, and would comprise molecules without the IBP 
domain and may or may not be C-terminally truncated as 
well . 

SEQ ID NO: 3 is a degenerate polynucleotide 
sequence that encompasses all polynucleotides that could 
encode the zCTGF4 polypeptide of SEQ ID NO: 2 (amino acids 
1 or 24 to 354) . Thus, zCTGF4 polypeptide -encoding 
polynucleotides ranging from nucleotide 17 or 85 to 
nucleotide 1078 of SEQ ID NO: 2 or nucleotide 1 or 6 9 to 
1062 of SEQ ID NO: 3 are contemplated by the present 
invention. Also contemplated by the present invention are 
fragments and fusions as described herein with respect to 
SEQ ID NO: 1, which are formed from analogous regions of 
SEQ ID NO: 3, wherein nucleotides 191 to 322 of SEQ ID NO: 
1 correspond to nucleotides 175 to 3 06 of SEQ ID NO: 3, 
for the IBP domain; wherein nucleotides 3 56 to 553 of SEQ 
ID NO: 1 correspond to nucleotides 340 to 53 7 of SEQ ID 
NO: 3, for the von Willebrand factor (VWFc) domain; 
wherein nucleotides 554 to 640 of SEQ ID NO : 1 correspond 
to nucleotides 538 to 624 of SEQ ID NO : 3 for the variable 
region; wherein nucleotides 641 to 772 of SEQ ID NO: 1 
correspond to nucleotides 625 to 756 of SEQ ID NO : 3, for 
the sulfated glycocon jugate binding domain; and wherein 
nucleotide 800 to nucleotide 904 of SEQ ID NO: 1 
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correspond to nucleotide 784 to nucleotide 888 of SEQ ID 
NO: 3 for the heparin binding domain. Table 1 sets forth 
the one- letter codes used within SEQ ID NO: 3 to denote 
degenerate nucleotide positions. "Resolutions" are the 
nucleotides denoted by a code letter. "Complement" 
indicates the code for the complementary nucleotide (s) . 
For example, the code Y denotes either C or T, and its 
complement R denotes A or G, A being complementary to T, 
and G being complementary to C. 
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TABLE 1 



Nucleotide Resolution Complement Resolution 



A 


A 


T 


T 


C 


C 


G 


G 


G 


6 


C 


C 


T • 


T 


A 


A 


R , 


A|G 


Y 


C|T 


Y 


C|T 


R 


A|G 


= . M 


AIC 


K 


GIT 


9 K 


G|T 


M 


A|C 


KB! :. ^ 


C|G 


S 


C|G 


bk? :■ 

II w 


A|T 


W 


A|T 


^ j_j 
• II 

ass; i 


A|C|T 


D 


A|G|T 


B 


C|G|T 


V 


A|C|G 


V 


A|C|G 


B 


C|G|T 


?! D 


A|G|T 


H 


A j C | T 


..... 

v 


A } C J G [ T 


N 


A|C|G|T 
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The degenerate codons used in SEQ ID NOS : 2 and 
5, encompassing all possible codons for a given amino 
acid, are set forth in Table 2. 



TABLE 2 





One 






Amino 


Letter 


Codons 


Degenerate 


Acid 


Code 




Codon 


Cys 


C 


TGC TGT 


TGY 


Ser 


S 


AGC AGT TCA TCC TCG TCT 


WSN 


Thr 


T 


ACA ACC ACG ACT 


ACN 


Pro 


P • 


CCA CCC CCG CCT 


CCN 


Ala 


A 


GCA GCC GCG GCT 


GCN 


Gly 


G 


GGA GGC GGG GGT 


GGN 


Asn 


N 


AAC AAT 


AAY 


Asd 


D 


GAC GAT 


GAY 


Glu 


E 


GAA GAG 


GAR 


Gin 


Q 


CAA CAG 


CAR 


His 


H 


CAC CAT 


CAY 


Arn 


R 


AGA AGG CGA CGC CGG CGT 


MGN 


Lys 


K 


AAA AAG 


AAR 


Met 


M 


ATG 


ATG 


He 


I 


ATA ATC ATT 


ATH 


! PU 

i — V— m 


L 


CTA CTC CTG CTT TTA 1 IG 


YTN 


Val 


V 


GTA GTC GTG GTT 


GTN 


Phe 


F 


TTC 1 1 1 


TTY 


Tvp 


Y 


TAC TAT 


TAY 


Trp 


w 


TGG 


TGG 


Ter 


« 


TAA TAG TGA 


TRR 


Asn | Asp 


B 




RAY 


Glu|Gln 


Z 




SAR 


Any 


X 




NNN 
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One of ordinary skill in the art will appreciate 
that some ambiguity is introduced in determining a 
degenerate codon, representative of all possible codons 
encoding each amino acid. For example, the degenerate 
5 codon for serine (WSN) can, in some circumstances, encode 
arginine (AGR) , and the degenerate codon for arginine 
(MGN) can, in some circumstances, encode serine (AGY) . A 
similar relationship exists between codons encoding 
phenylalanine and leucine. Thus, some polynucleotides 
10 encompassed by the degenerate sequence may encode variant 
amino acid sequences, but one of ordinary skill in the art 
can easily identify such variant sequences by reference to 
the amino acid sequence of SEQ ID NOS : 2 and 5. Variant 
sequences can be readily tested for functionality as 

15 described herein. 

One of ordinary skill in the art will also 
appreciate that different species can exhibit 
"preferential codon usage." In general, see, Grantham, et 
al., Nuc . Acids Res . , 8:1893-912, 1980; Haas, et al . Curr. 

20 Biol . , 6:315-24, 1996; Wain-Hobson, et al . , Gene , 13.: 355- 
64, 1981; Grosjean and Fiers, Gene , 18:199-209, 1982; 
Holm, Nuc - Acids Res . , 14:3075-87, 1986; Ikemura, J ■ Mol . 
Biol . , 158:573-97, 1982. As used herein, the term 

"preferential codon usage" or "preferential codons" is a 

25 term of art referring to protein translation codons that 
are most frequently used in cells of a certain species, 
thus favoring one or a few representatives of the possible 
codons encoding each amino acid (See Table 2) . For 
example, the amino acid Threonine (Thr) may be encoded by 

30 ACA, ACC, ACG, or ACT, but in mammalian cells ACC is the 
most commonly used codon; in other species, for example, 
insect cells, yeast, viruses or bacteria, different Thr 
codons may be preferential. Preferential codons for a 
particular species can be introduced into the 
35 polynucleotides of the present invention by a variety of 
methods known in the art. Introduction of preferential 
codon sequences into recombinant DNA can, for example, 
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enhance production of the protein by making protein 
translation more efficient within a particular cell type 
or species- Therefore, the degenerate codon sequence 
disclosed in SEQ ID NO: 3 serves as a template for 
optimizing expression of polynucleotides in various cell 
types and species commonly used in the art and disclosed 
herein. Sequences containing preferential codons can be 
tested and optimized for expression in various species, 
and tested for functionality as disclosed herein. 

The present invention further provides variant 
polypeptides and nucleic acid molecules that represent 
counterparts from other species (orthologs) . These 
species include, but are not limited to mammalian, avian, 
amphibian, reptile, fish, insect and other vertebrate and 
invertebrate species. Of particular interest are zCTGF4 
polypeptides from other mammalian species, including 
murine, porcine, ovine, bovine, canine, feline, equine, 
and other primate polypeptides. Orthologs of human zCTGF4 
can be cloned using information and compositions provided 
by the present invention in combination with conventional 
cloning techniques. For example, a cDNA can be cloned 
using mRNA obtained from a tissue or cell type that 
expresses zCTGF4 as disclosed herein. Suitable sources of 
mRNA can be identified by probing northern blots with 
probes designed from the sequences disclosed herein. A 
library is then prepared from mRNA of a positive tissue or 
cell line. The mouse sequence ZCTGF2 is a representative 
ortholog of the human connective tissue growth factor, 
ZCTGF4, and is disclosed herein as SEQ ID NOS : 4 and 5. 

An zCTGF4 -encoding cDNA can then be isolated by 
a variety of methods, such as by probing with a complete 
or partial human cDNA or with one or more sets of 
degenerate probes based on the disclosed sequences. A 
cDNA can also be cloned using the polymerase chain 
reaction with primers designed from the representative 
human zCTGF4 sequences disclosed herein. Within an 

additional method, the cDNA library can be used to 
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transform or transfect host cells, and expression of the 
cDNA of interest can be detected with an antibody to 
zCTGF4 polypeptide. Similar techniques can also be 

applied to the isolation of genomic clones. 
5 The present invention provides polynucleotide 

molecules including DNA and RNA molecules that encode the 
zCTGF4 polypeptides disclosed above . 

ZCTGF4 polynucleotide sequences disclosed herein 
can also be used as probes or primers to clone 5 1 non- 

10 coding regions of a zCTGF4 gene. In view of the tissue- 
specific expression observed for zCTGF4 by Northern 
blotting, this gene region is expected to provide for 
testis-, trachea-, bone marrow- , and kidney-specific 
expression. Promoter elements from a zCTGF4 gene could 

15 thus be used to direct the tissue-specific expression of 
heterologous genes in, for example, transgenic animals or 
patients treated with gene therapy. Cloning of 5 r 
flanking sequences also facilitates production of zCTGF4 
proteins by "gene activation 11 as disclosed in U.S. Patent 

20 No. 5,641,670, Briefly, expression of an endogenous 
ZCTGF4 gene in a cell is altered by introducing into the 
ZCTGF4 locus a DNA construct comprising at least a 
targeting sequence, a regulatory sequence, an exon, and an 
impaired splice donor site. The targeting sequence is a 

25 zCTGF4 5 1 non-coding sequence that permits homologous 
recombination of the construct with the endogenous zCTGF4 
locus, whereby the sequences within the construct become 
operably linked with the endogenous zCTGF4 coding 
sequence. In this way, an endogenous zCTGF4 promoter can 

3 0 be replaced or supplemented with other regulatory 
sequences to provide enhanced, tissue-specific, or 
otherwise regulated expression. 

Those skilled in the art will recognize that the 
sequence disclosed in SEQ ID NO:l represents a single 

35 allele of human zCTGF4 and that allelic variation and 
alternative splicing are expected to occur. Allelic 
variants of this sequence can be cloned by probing cDNA or 
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genomic libraries from different individuals according to 
standard procedures. Allelic variants of the nucleotide 
sequence shown in SEQ ID NO:l, including those containing 
silent mutations and those in which mutations result in 
5 amino acid sequence changes, are within the scope of the 
present invention, as are proteins which are allelic 
variants of SEQ ID NO: 2. cDNA molecules generated from 
alternatively spliced mRNAs, which retain the properties 
of the zCTGF4 polypeptide are included within the scope of 

10 the present invention, as are polypeptides encoded by such 
cDNAs and mRNAs. Allelic variants and splice variants of 
these sequences can be cloned by probing cDNA or genomic 
libraries from different individuals or tissues according 
to standard procedures known in the art . 

15 The present invention provides methods for using 

ZCTGF4 polynucleotides and polypeptides to diagnose 
chromosomal disorders associated with abnormal expression 
of the zCTGF4 protein. Detectable chromosomal mutations 
at the zCTGF4 gene locus include, but are not limited to, 

20 aneuploidy, gene copy number changes, insertions, 
deletions, restriction site changes and rearrangements. 
Such aberrations can be identified by employing molecular 
genetic techniques, such as restriction fragment length 
polymorphism (RFLP) analysis, short tandem repeat (STR) 

25 analysis employing PCR techniques, and other genetic 
linkage analysis techniques known in the art (Molecular 
Cloning: A Laboratory Manual , 2nd ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 19 89, and 
Ausubel et al . , eds . , Current Protocols in Molecular 

30 Biology , John Wiley and Sons, Inc., NY, 1987; A.J, Marian, 
Chest 108 :255-65, 1995) . Analyses of DNA samples can 
detect deletions and insertions by changes in size in 
amplified DNA products by comparing a sample DNA to, a 
normal zCTGF4 DNA standard. Mismatches in duplex DNA can 

35 be detected by RNase digestion or differences in melting 
temperature. Other methods for detecting differences in 
sequences include, changes electrophoretic motility, 
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Southern analysis, and direct DNA sequencing. Recently, 
techniques for accessing genetic information with high- 
density arrays have been available (Chee et al . , Science 
274 : 610-614 , 1996), and can analyze large fragments of 
5 genomic DNA with high resolution. 

Analysis of chromosomal DNA using the zCTGF4 
polynucleotide sequence is useful for correlating disease 
with abnormalities localized to chromosome 6. The zCTGF4 
gene has been localized to chromosome 6q22.1, Studies of 

10 the DNA sequences, cDNA and/or genomic DNA, of some of the 
individuals presenting disease that contain a mutation in 
the sequence of the ZCTGF4 gene, that is not present in 
normal individuals, can provide strong evidence for the 
mutation as causative of the disease. In one embodiment, 

15 the methods of the present invention provide a method of 
detecting a chromosome 6q abnormality in sample from an 
individual comprising: (a) obtaining zCTGF4 RNA from the 
sample; (b) generating ZCTGF4 cDNA by polymerase chain 
reaction; and (c) comparing the nucleic acid sequence of 

20 the ZCTGF4 cDNA to the nucleic acid sequence as shown in 
SEQ ID NO: 1. In further embodiments, the difference 
between the sequence of the zCTGF4 cDNA or zCTGF4 gene in 
the sample and the zCTGF4 sequence as shown in SEQ ID NO: 
1 is indicative of chromosome 6q abnormality. 

25 Within preferred embodiments of the invention, 

the isolated nucleic acid molecules can hybridize under 
stringent conditions to nucleic acid molecules having the 
nucleotide sequence of SEQ ID NO:l, to nucleic acid 
molecules having the nucleotide sequence of nucleotides 86 

3 0 to 1078 of SEQ ID NO:l, or to nucleic acid molecules 
having a nucleotide sequence complementary to SEQ ID NO:l. 
In general, stringent conditions are selected to be about 
5°C lower than the thermal melting point (T m ) for the 
specific sequence at a defined ionic strength and pH. The 

35 T m is the temperature (under defined ionic strength and 
pH) at which 50% of the target sequence hybridizes to a 
perfectly matched probe. 



A pair of nucleic acid molecules, such as DNA- 
DNA, RNA-RNA and DNA-RNA, can hybridize if the nucleotide 
sequences have some degree of complementarity. Hybrids can 
tolerate mismatched base pairs in the double helix, but 
the stability of the hybrid is influenced by the degree of 
mismatch. The T m of the mismatched hybrid decreases by l°C 
for every 1-1.5% base pair mismatch. Varying the 

stringency of the hybridization conditions allows control 
over the degree of mismatch that will be present in the 
hybrid. The degree of stringency increases as the 
hybridization temperature increases and the ionic strength 
of the hybridization buffer decreases. Stringent 
hybridization conditions encompass temperatures of about 
5-25°C below the T m of the hybrid and a hybridization 
buffer having up to 1 M Na + . Higher degrees of stringency 
at lower temperatures can be achieved with the addition of 
formamide which reduces the T^ of the hybrid about 1°C for 
each 1% formamide in the buffer solution. Generally, such 
stringent conditions include temperatures of 20-70 °C and a 
hybridization buffer containing up to 6x SSC and 0-50% 
formamide. A higher degree of stringency can be achieved 
at temperatures of from 40-70°C with a hybridization 
buffer having up to 4x SSC and from 0-50% formamide. 
Highly stringent conditions typically encompass 
temperatures of 42-70°C with a hybridization buffer having 
up to Ix SSC and 0-50% formamide. Different degrees of 
stringency can be used during hybridization and washing to 
achieve maximum specific binding to the target sequence. 
Typically, the washes following hybridization are 
performed at increasing degrees of stringency to remove 
non-hybridized polynucleotide probes from hybridized 
complexes . 

The above conditions are meant to serve as a 
guide and it is well within the abilities of one skilled 
in the art to adapt these conditions for use with a 
particular polypeptide hybrid. The T m for a specific 
target sequence is the temperature (under defined 
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conditions) at which 50% of the target sequence will 
hybridize to a perfectly matched probe sequence. Those 
conditions which influence the T m include, the size and 
base pair content of the polynucleotide probe, the ionic 
strength of the hybridization solution, and the presence 
of destabilizing agents in the hybridization solution. 
Numerous equations for calculating T ra are known in the 
art, and are specific for DNA, RNA and DNA-RNA hybrids and 
polynucleotide probe sequences of varying length (see, for 
example, Sambrook et al,, Molecular Cloning: A Laboratory 
Manual, Second Edition (Cold Spring Harbor Press 1989) ; 
Ausubel ,et al . , (eds.), Current Protocols in Molecular 
Biology (John Wiley and Sons, Inc. 1987); Berger and 
Kimmel (eds.), Guide to Molecular Cloning Techniques, 
(Academic Press, Inc. 1987); and Wetmur, Crit. Rev. 
Biochem. Mol. Biol. 26:221 (1990)). Sequence analysis 
software such as OLIGO 6.0 (LSR; Long Lake, MN) and Primer 
Premier 4.0 (Premier Biosoft International; Palo Alto, 
CA) , as well as sites on the Internet, are available tools 
for analyzing a given sequence and calculating T m based on 
user defined criteria. Such programs can also analyze a 
given sequence under defined conditions and identify 
suitable probe sequences. Typically, hybridization of 
longer polynucleotide sequences, >50 base pairs, is 
performed at temperatures of about 20-25°C below the 
calculated T m . For smaller probes, <50 base pairs, 
hybridization is typically carried out at the T m or 5-10°C 
below. This allows for the maximum rate of hybridization 
for DNA -DNA and DNA-RNA hybrids. 

The length of the polynucleotide sequence 
influences the rate and stability of hybrid formation. 
Smaller probe sequences, <50 base pairs, reach equilibrium 
with complementary sequences rapidly, but may form less 
stable hybrids. Incubation times of anywhere from minutes 
to hours can be used to achieve hybrid formation. Longer 
probe sequences come to equilibrium more slowly, but form 
more stable complexes even at lower temperatures . 
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Incubations are allowed to proceed overnight or longer. 
Generally, incubations are carried out for a period equal 
to three times the calculated Cot time. Cot time, the 
time it takes for the polynucleotide sequences to 
5 reassociate, can be calculated for a particular sequence 
by methods known in the art . 

The base pair composition of polynucleotide 
sequence will effect the thermal stability of the hybrid 
complex, thereby influencing the choice of hybridization 

10 temperature and the ionic strength of the hybridization 
buffer. A-T pairs are less stable than G-C pairs in 
aqueous solutions containing sodium chloride. Therefore, 
the higher the G-C content, the more stable the hybrid. 
Even distribution of G and C residues within the sequence 

15 also contribute positively to hybrid stability. In 
addition, the base pair composition can be manipulated to 
alter the T m of a given sequence. For example, 5- 
methyldeoxycytidine can be substituted for deoxycytidine 
and 5-bromodeoxuridine can be substituted for thymidine to 

20 increase the T m# whereas 7-deazz-2 1 -deoxyguanosine can be 
substituted for guanosine to reduce dependence on T m . 

The ionic concentration of the hybridization 
buffer also affects the stability of the hybrid. 
Hybridization buffers generally contain blocking agents 

25 such as Denhardt 1 s solution (Sigma Chemical Co., St. 
Louis, Mo.), denatured salmon sperm DNA, tRNA, milk 
powders (BLOTTO) , heparin or SDS, and a Na + source, such as 
SSC (lx SSC: 0.15 M sodium chloride, 15 mM sodium citrate) 
or SSPE (lx SSPE: 1.8 M NaCl , 10 mM NaH 2 P0 4 , 1 mM EDTA, pH 

30 7.7). By decreasing the ionic concentration of the 
buffer, the stability of the hybrid is increased. 
Typically, hybridization buffers contain from between 10 
mM - 1 M Na + . The addition of destabilizing or denaturing 
agents such as formamide, tetralkyl ammonium salts, 

35 guanidinium cations or thiocyanate cations to the 
hybridization solution will alter the T m of a hybrid. 
Typically, formamide is used at a concentration of up to 



50% to allow incubations to be carried out at more 
convenient and lower temperatures. Formamide also acts to 
reduce non-specific background when using RNA probes. 

As an illustration, a nucleic acid molecule 
encoding a variant zCTGF4 polypeptide can be hybridized 
with a nucleic acid molecule having the nucleotide 
sequence of SEQ ID N0:1 (or its complement) at 42 °C 
overnight in a solution comprising 50% formamide, 5xSSC 
(lxSSC: 0.15 M sodium chloride and 15 mM sodium citrate), 
50 mM sodium phosphate (pH 7.6), 5x Denhardt f s solution 
(lOOx Denhardt 1 s solution: 2% (w/v) Ficoll 400, 2% -(w/v) 
polyvinylpyrrolidone, and 2% (w/v) bovine serum albumin) , 
10% dextran sulfate, and 2 0 jag/ml denatured, sheared 
salmon sperm DNA. One of skill in the art can devise 
variations of these hybridization conditions. For 
example, the hybridization mixture can be incubated at a 
higher temperature, such as about 6 5°C, in a solution that 
does not contain formamide. Moreover, premixed 

hybridization solutions are available (e.g., EXPRESSHYB 
Hybridization Solution from CLONTECH Laboratories, Inc.), 
and hybridization can be performed according to the 
manufacturer 1 s instructions . 

Following hybridization, the nucleic acid 
molecules can be washed to remove non-hybridized nucleic 
acid molecules under stringent conditions, or under highly 
stringent conditions. Typical stringent washing 

conditions include washing in a solution of 0 . 5x - 2x SSC 
with 0.1% sodium dodecyl sulfate (SDS) at 55 - 65°C. That 
is, nucleic acid molecules encoding a variant zCTGF4 
polypeptide hybridize with a nucleic acid molecule having 
the nucleotide sequence of SEQ ID N0:1 (or its complement) 
under stringent washing conditions,' in which the wash 
stringency is equivalent to 0.5x - 2x SSC with 0,1% SDS at 
55 - 65°C, including 0 . 5x SSC with 0.1% SDS at 55°C, or 
2xSSC with 0.1% SDS at 65°C. One of skill in the art can 
readily devise equivalent conditions, for example, by 
substituting SSPE for SSC in the wash solution. 
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Typical highly stringent washing conditions 
include washing in a solution of 0 . lx - 0 . 2x SSC with 0.1% 
sodium dodecyl sulfate (SDS) at 50 - 65 °C. In other 
words, nucleic acid molecules encoding a variant zCTGF4 
5 polypeptide hybridize with a nucleic acid molecule having 
the nucleotide sequence of SEQ ID N0:1 (or its complement) 
under highly stringent washing conditions, in which the 
wash stringency is equivalent to 0 . lx - 0.2x SSC with 0.1% 
SDS at 50 - 65°C, including O.lx SSC with 0.1% SDS at 

* 

10 50°C, or 0.2xSSC with 0.1% SDS at 65°C. 

The present invention also provides isolated 
zCTGF4 polypeptides that have a substantially similar 
M sequence identity to the polypeptides of SEQ ID NO: 2, or 

their orthologs. The term "substantially similar sequence 
y[ 15 identity" is used herein to denote polypeptides comprising 
N* at least 70%, at least 80%, at least 90%, at least 95% or 

ffi greater than 95% sequence identity to the sequences shown 

yp in SEQ ID NO: 2, or their orthologs. The present invention 

!\ also includes polypeptides that comprise an amino acid 

N; 20 sequence having at least 70%, at least 80%, at least 90%, 
J^j at least 95% or greater than 95% sequence identity to the 

r;i sequence of amino acid residues 1 or 24 to 354 of SEQ ID 

N ; NO: 2. The present invention further includes nucleic acid 

molecules that encode such polypeptides. Methods for 
25 determining percent identity are described below. 

The present invention also contemplates zCTGF4 
variant nucleic acid molecules that can be identified 
using two criteria: a determination of the similarity 
between the encoded polypeptide with the amino acid 
30 sequence of SEQ ID NO: 2, and/or a hybridization assay, as 
described above. Such zCTGF4 variants include nucleic 
acid molecules (1) that hybridize with a nucleic acid 
molecule having the nucleotide sequence of SEQ ID NO:l (or 
its complement) under stringent washing conditions, in 
35 which the wash stringency is equivalent to 0 . 5x - 2x SSC 
with 0.1% SDS at 55 - 6 5 °C, or (2) that encode a 
polypeptide having at least 70%, at least 80%, at least 
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90%, at least 95% or greater than 95% sequence identity to 

the amino acid sequence of SEQ ID NO: 2. Alternatively, 

zCTGF4 variants can be characterized as nucleic acid 

molecules (1) that hybridize with a nucleic acid molecule 

having the nucleotide sequence of SEQ ID NO:l (or its 

complement) under highly stringent washing conditions, in 

which the wash stringency is equivalent to 0 . lx - 0 . 2x SSC 

with 0.1% SDS at 50 - 65°C, and (2) that encode a 

polypeptide having at least 70%, at least 80%, at least 

90%, at least 95% or greater than 95% sequence identity to 

the amino acid sequence of SEQ ID N0:2. 

Percent sequence identity is determined by 

conventional methods. See, for example, Altschul et al . , 

Bull. Math. Bio. 48:603 (1986), and Henikoff and Henikoff, 

Proc. Natl. Acad. Sci. USA 85:10915 (1992) . Briefly, two 

amino acid sequences are aligned to optimize the alignment 

scores using a gap opening penalty of 10, a gap extension 

penalty of 1, and the "BLOSUM62 " scoring matrix of 

Henikoff and Henikoff (ibid.) as shown in Table 3 (amino 

acids are indicated by the standard one -letter codes) . 

Total number of identical matches 

x 100 

[length of the longer sequence plus the 

number of gaps introduced into the longer 

sequence in order to align the two sequences] 



> 

CO 
En 

H 

CD 

o 
a 
p 

pi 



LD 



H 
l 



CM 
l 



CN 
I 

CN 



cO 
I 

oo 
i 

m 
i 



in 



m 
i 



in 
cm 



i 

CM 



O 
H 



H 
i 



o 



CM 
I 

CM 
l 

I 

rH 
t 



CN 
1 



00 



CN 
! 



CO 

I 

H 
I 



CM 
I 



CO 

t 

1 

CO 

i 

CO 
I 

H 
I 

co 
i 

m 
i 

co 
i 

rH 
l 



CM 



00 

I 

00 

I 

CM 
1 

H 
i 

i 

00 

I 

CM 
I 



in 



CM 

i 

CO 



H 
i 

CM 
t 

H 
rH 



CO 
1 

I 



^0 



i 



CM 



rH 
I 



in 

H 
I 

CM 

H 

CM 
I 

00 

I 

CM 
O 



H 
I 

CO 
I 

CM 
I 

H 
1 

rH 
I 



00 

I 

o 
o 



rH 
I 

00 

I 

00 

1 

00 

I 

CM 
I 

CO 

t 

00 

t 

00 
I 

CM 
I 



I 

CM 
I 

rH 
I 

00 
1 

00 

I 

CM 
i 

CM 
I 

H 
l 

rH 
I 

00 

I 

rH 

i 

CM 
i 

CM 
I 

rH 
I 



rH 
I 

CM 
1 

H 
I 



rH 
H 

in cm 



oo 
i 



CM 
I 

CM 
I 

rH 
I 



rH 
I 

O 

rH 

rH 
I 

rH 



H 
t 

CM 
I 

H 
I 

H 
i 

H 
i 

rH 
1 

CM 
I 

CM 
1 

rH 
1 

rH 
1 

rH 
I 

rH 
1 



I 



CM 



CM 
l" 

CM 
1 

00 
I 

00 



H 
I 

00 



o 



C^ 
1 

CM 
1 

H 
i 



H 
i 

00 
I 

CM 
l 

OO 

I 

CM 
I 

CM 

I 

00 
I 

CM 
1 

CM 



rH 

I 



I 

I 

00 
I 

CO 

1 



H 
i 

CM 
I 

rH 

I 

rH 
I 

CM 



00 

I 

CM 
I 

rH 
I 

CM 
I 

OO 

1 

CM 
I 

CM 
I 

CM 
1 



CM 
I 



00 



00 

I 

00 

1 

CM 
1 

CM 
t 

rH 
I 

00 
I 

00 
I 

00 
I 



o 

> 



If) 



o 

rH 



in 

rH 



o 

CM 



30 



Those skilled in the art appreciate that there 
are many established algorithms available to align two 
amino acid sequences. The " FAST A" similarity search 
algorithm of Pearson and Lipman is a suitable protein 
5 alignment method for examining the level of identity 
shared by an amino acid sequence disclosed herein and the 
amino acid sequence of a putative variant zCTGF4 . The 
FASTA algorithm is described by Pearson and Lipman, Proc. 
Nat'l Acad. Sci,. USA 85:2444 (1988), and by Pearson, Meth. 

10 Enzymol. 283:63 (1990). 

Briefly, FASTA first characterizes sequence 
similarity by identifying regions shared by the query 
sequence (e.g., SEQ ID N0:2) and a test sequence that have 
either the highest density of identities (if the ktup 

15 variable is 1) or pairs of identities (if ktup=2) , without 
considering conservative amino acid substitutions, 
insertions, or deletions. The ten regions with the 
highest density of identities are then rescored by 
comparing the similarity of all paired amino acids using 

20 an amino acid substitution matrix, and the ends of the 
regions are "trimmed" to include only those residues that 
contribute to the highest score. If there are several 
regions with scores greater than the "cutoff" value 
(calculated by a predetermined formula based upon the 

25 length of the sequence and the ktup value) , then the 
trimmed initial regions are examined to determine whether 
the regions can be joined to form an approximate alignment 
with gaps. Finally, the highest scoring regions of the 
two amino acid sequences are aligned using a modification 

30 of the Needleman-Wunsch-Sellers algorithm (Needleman and 
Wunsch, J. Mol. Biol. 48:444 (1970); Sellers, SIAM J. 
Appl. Math. 26:787 (1974)), which allows for amino acid 
insertions and deletions. Illustrative parameters for 
FASTA analysis are: ktup=l, gap opening penalty=10, gap 

35 extension penalty=l, and substitution matrix=BLOSUM62 . 
These parameters can be introduced into a FASTA program by 
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modifying the scoring matrix file ( "SMATRIX" ) , as 
explained in Appendix 2 of Pearson, Meth. Enzymol. 183:63 
(1990) . 

FAST A can also be used to determine the- sequence 
5 identity of nucleic acid molecules using a ratio as 
disclosed above. For nucleotide sequence comparisons, the 
ktup value can range between one to six, preferably from 
four to six. 

Variant zCTGF4 polypeptides or polypeptides with 

10 substantially similar sequence identity are characterized 
as having one or more amino acid substitutions, deletions 
or additions. These changes are preferably of a minor 
nature, that is conservative amino acid substitutions (see 
Table 4) and other substitutions that do not significantly 

15 affect the folding or activity of the polypeptide; small 
deletions, typically of one to about 30 amino acids; and 
amino- or carboxyl -terminal extensions, such as an amino- 
terminal methionine residue, a small linker peptide of up 
to about 20-25 residues, or an affinity tag. The present 

20 invention thus includes polypeptides of from about 2 8 to 
3 54 amino acid residues that comprise a sequence that is 
at least 70%, preferably at least 90%, and more preferably 
95% or more identical to the corresponding region of SEQ 
ID NO: 2. In particular, peptides and polypeptides 

25 corresponding to domains of the zCTGF4 molecules as shown 
in SEQ ID NO: 2 include the IBP domain (residues 59-102) , 
the von Willebrand factor C domain (residues 114-179) , the 
variable domain (residues 180-208) and the glycocon jugate 
binding domains (residues 209-252) are within the scope of 

3 0 the present invention. Polypeptides comprising affinity 
tags can further comprise a proteolytic cleavage site 
between the zCTGF4 polypeptide- and the affinity tag. 
Preferred such sites include thrombin cleavage sites and 
factor Xa cleavage sites. 

35 
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Table 4 



Conservative amino acid substitutions 



Basic : 



• * 



gag* 



■ 



10 



15 



20 



Acidic : 



Polar: 



Hydrophobic 



Aromatic : 



Small : 



argmme 
lysine 
histidine 
glutamic acid 
aspartic acid 
glut amine 
asparagine 
leucine 
isoleucine 
valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

serine 

threonine 

methionine 



25 The proteins of the present invention can also 

comprise non-naturally occurring amino acid residues. 
Non-naturally occurring amino acids include, without 
limitation, trans- 3 -me thylproline , 2, 4-methanoproline, 
cis-4 -hydroxyproline , trans- A -hydroxyproline , N- 

30 methylglycine, a21o-threonine, methyl threonine, 

hydroxyethylcysteine , hydroxyethylhomocysteine , 

nitroglutamine , homoglutamine, pipecolic acid, 

thiazolidine carboxylic acid, dehydroproline, 3- and 4- 
me thylproline, 3 , 3-dimethylproline, tert- leucine, 

3 5 norvaline, 2-azaphenylalanine, 3 -azaphenylalanine , 4- 
azaphenylalanine, and 4-f luorophenylalanine . Several 



methods are known in the art for incorporating non- 
naturally occurring amino acid residues into proteins. 
For example, an in vitro system can be employed wherein 
nonsense mutations are suppressed using chemically 
aminoacylated suppressor tRNAs . Methods for synthesizing 
amino acids and aminoacylating tRNA are known in the art. 
Transcription and translation of plasmids containing 
nonsense mutations is typically carried out in a cell- free 
system comprising an E. coli S3 0 extract and commercially 
available enzymes and other reagents. Proteins are 
purified by chromatography. See, for example, Robertson 
et al., J. Am. Chem. Soc. 113:2722 (1991), Ellman et al . , 
Methods Enzymol . 202:301 (1991), Chung et al . , Science 
259:806 (1993), and Chung et al . , Proc. Nat'l Acad. Sci . 
USA 50:10145 (1993) . 

In a second method, translation is carried out 
in Xenopus oocytes by microinjection of mutated mRNA and 
chemically aminoacylated suppressor tRNAs (Turcatti et 
al., J. Biol. Chem. 271:19991 (1996)). Within a third 
method, E, coli cells are cultured in the absence of a 
natural amino acid that is to be replaced (e.g., 
phenylalanine) and in the presence of the desired non- 
naturally occurring amino acid(s) (e.g., 2- 

azaphenyl alanine , 3 -azaphenylalanine , 4 -azaphenylalanine , 
or 4- fluorophenyl alanine) . The non-naturally occurring 
amino acid is incorporated into the protein in place of 
its natural counterpart. See, Koide et al . , Biochem. 
33:7470 (1994) . Naturally occurring amino acid residues 
can be converted to non-naturally occurring species by in 
vitro chemical modification. Chemical modification can be 
combined with site-directed mutagenesis to further expand 
the range of substitutions (Wynn and Richards > Protein 

Sci. 2;395 (1993) ) . 

A limited number of non-conservative amino 

acids, amino acids that are not encoded by the genetic 
code, non-naturally occurring amino acids, and unnatural 
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amino acids may be substituted for zCTGF4 amino acid 
residues. 

Multiple amino acid substitutions can be made 
and tested using known methods of mutagenesis and 
5 screening, such as those disclosed by Reidhaar-Olson and 
Sauer (Science 241:53 (1988)) or Bowie and Sauer (Proc. 
Nat'l Acad. Sex. USA 86:2152 (1989)). Briefly, these 
authors disclose methods for simultaneously randomizing 
two or more ppsitions in a polypeptide, selecting for 
10 functional polypeptide, and then sequencing the 
mutagenized polypeptides to determine the spectrum of 
allowable substitutions at each position. Other methods 
M that can be used include phage display {e.g., Lowman et 

O stl., Biochem. 30:10832 (1991), Ladner et al . , U.S. Patent 

15 No. 5,223,409 / Huse, international publication No. WO 
M; 92/06204, and region-directed mutagenesis (Derbyshire et 

JfJ al., Gene 46:145 (1986), and Ner et al., DNA 7:127, 

i:i (1988)). 

s Variants of the disclosed zCTGF4 nucleotide and 

\ : 2 0 polypeptide sequences can also be generated through DNA 
O shuffling as disclosed by Stemmer, Nature 370:389 (1994), 

Si? Stemmer, Proc. Nat'l Acad. Sci. USA 51:10747 (1994), and 

ft, international publication No. WO 97/20078. Briefly, 

variant DNA molecules are generated by in vitro homologous 
25 recombination by random fragmentation of a parent DNA 
followed by reassembly using PCR, resulting in randomly 
introduced point mutations. This technique can be 

modified by using a family of parent DNA molecules, such 
as allelic variants or DNA molecules from different 
3 0 species, to introduce additional variability into the 
process. Selection or screening for the desired activity, 
followed by additional iterations of mutagenesis and assay 
provides for rapid "evolution" of sequences by selecting 
for desirable mutations while simultaneously selecting 
35 against detrimental changes. 



Mutagenesis methods as disclosed herein can be 
combined with high- throughput , automated screening methods 
to detect activity of cloned, mutagenized polypeptides in 
host cells. Mutagenized DNA molecules that encode 
biologically active polypeptides, or polypeptides that 
bind with anti-zCTGF4 antibodies, can be recovered from 
the host cells and rapidly sequenced using modern 
equipment. These methods allow the rapid determination of 
the importance^ of individual amino acid residues in a 
polypeptide of interest, and can be applied to 
polypeptides of unknown structure. 

Essential amino acids in the polypeptides of the 
present invention can be identified according to 
procedures known in the art, such as site-directed 
mutagenesis or alanine -scanning mutagenesis (Cunningham 
and Wells, Science 244:1081 (1989), Bass et al . , Proc. 
Nat '2 Acad. Sci . USA 88:4498 (1991), Coombs and Corey, 
"Site-Directed Mutagenesis and Protein Engineering, " in 
Proteins: Analysis and Design, Angeletti (ed.), pages 259- 
311 (Academic Press, Inc. 1998)). In the latter 

technique, single alanine mutations are introduced at 
every residue in the molecule, and the resultant mutant 
molecules are tested for biological activity as disclosed 
below to identify amino acid residues that are critical to 
the activity of the molecule. See also, Hilton et a2 . , J". 
Bio2. Chem. 272:4699 (1996). The identities of essential 
amino acids can also be inferred from analysis of 
homologies with zCTGF4. 

The location of zCTGF4 receptor binding domains 
can be identified by physical analysis of structure, as 
determined by such techniques as nuclear magnetic 
resonance, crystallography, electron diffraction or 
photoaf f inity labeling, in conjunction with mutation of 
putative contact site amino acids. See, for example, de 
Vos et al., Science 255:306 (1992), Smith et al . , J". Mol . 
Biol. 224:899 (1992), and Wlodaver et al . , FEBS Lett. 
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305:59 (1992) . Moreover, zCTGF4 labeled with biotin or 
FITC can be used for expression cloning of zCTGF4 
receptors . 

The present invention also includes "functional 
5 fragments" of zCTGF4 polypeptides and nucleic acid 
molecules encoding such functional fragments. As 
previously described herein, zCTGF4 is characterized by a 
multi-domain structure comprising a IBP domain (amino acid 
residue 59 (Pro) to 102 (Tyr) of SEQ ID NO: 2) that has 

10 been suggested as an insulin growth factor binding domain, 
a von Willebrand factor c domain (amino acid residue- 114 
(Cys) to 179 (Cys) of SEQ ID NO: 2) , a variable domain 
(amino acid 180 (Ser) to 208 (Lys) , and a sulfated 
glycocon jugate binding motif domain (amino acid residue 

15 209 (Cys) to 252 (Cys) of SEQ ID NO: 2) . Thus, the 
present invention further provides fusion proteins 
encompassing (a) polypeptide molecules comprising one or 
more of the domains described above, and (b) biologically 
active fragments comprising portions of one or more of the 

20 domains . The other polypeptide may be another domains 
from another CTGF, a non-native and/or an unrelated 
secretory signal peptide to facilitate secretion of the 
fusion protein. 

Routine deletion analyses of nucleic acid 
25 molecules can be performed to obtain functional fragments 
of a nucleic acid molecule that encodes an zCTGF4 
polypeptide. As an illustration, DNA molecules having the 
nucleotide sequence of SEQ ID NO:l can be digested with 
Bal31 nuclease to obtain a series of nested deletions. 
3 0 The fragments are then inserted into expression vectors in 
proper reading frame, and the expressed polypeptides are 
isolated and tested for zCTGF4, or for the ability to bind 
anti-zCTGF4 antibodies. One alternative to exonuclease 
digestion is to use oligonucleotide-directed mutagenesis 
35 to introduce deletions or stop codons to specify 
production of a desired fragment. Alternatively, 



37 



particular fragments of an zCTGF4 gene can be synthesized 
using the polymerase chain reaction. 

Standard methods for identifying functional 
domains are well-known to those of skill in the art. For 
5 example, studies on the truncation at either or both 
termini of interferons have been summarized by Horisberger 
and Di Marco, Pharmac. Ther. 66:507 (1995) . Moreover, 
standard techniques for functional analysis of proteins 
are described by, for example, Treuter et al . , Molec. Gen. 

10 Genet. 240:113 (1993), Content et al . , "Expression and 
preliminary deletion analysis of the 42 kDa 2-5A 
synthetase induced by human interferon, " in Biological 
Interferon Systems, Proceedings of ISIR-TNO Meeting on 
Interferon Systems, Cantell (ecL), pages 65-72 (Nijhoff 

15 1987) , Herschman, "The EGF Receptor, 11 in Control of Animal 
Cell Proliferation, Vol. 1, Boynton et al.,, (eds.) pages 
169-199 (Academic Press 1985), Coumailleau et al . , J. 
Biol. Chem. 270:29270 (1995); Fukunaga et al . , J. Biol. 
Chem. 270:25291 (1995); Yamaguchi et al . , Biochem. 

20 Pharmacol. 50:1295 (1995), and Meisel et al . , Plant Molec. 
Biol. 30:1 (1996) . 

The present invention also contemplates 
functional fragments of an zCTGF4 gene that has amino acid 
changes, compared with the amino acid sequence of SEQ ID 

2 5 NO: 2. A variant zCTGF4 gene can be identified on the 

basis of structure by determining the level of identity 
with nucleotide and amino acid sequences of SEQ ID NOs : 1 
and 2, as discussed above. An alternative approach to 
identifying a variant gene on the basis of structure is to 

3 0 determine whether a nucleic acid molecule encoding a 

potential variant zCTGF4 gene can hybridize to a nucleic 
acid molecule having the nucleotide sequence of SEQ ID 
NO:l, as discussed above. 

The present invention also provides polypeptide 
3 5 fragments or peptides comprising an epitope -bearing 
portion of an zCTGF4 polypeptide described herein. Such 
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fragments or peptides may comprise an "immunogenic 
epitope, " which is a part of a protein that elicits an 
antibody response when the entire protein is used as an 
immunogen. Immunogenic epitope -bearing peptides can be 
identified using standard methods (see, for example, 
Geysen et al . , Proc. Nat 1 ! Acad. Sci. USA 81:3998 (1983)). 

In contrast, polypeptide fragments or peptides 
may comprise an "antigenic epitope, " which is a region of 
a protein molecule to which an antibody can specifically 
bind. Certain epitopes consist of a linear or contiguous 
stretch of amino acids, and the antigenicity of such an 
epitope is not disrupted by denaturing agents . It is known 
in the art that relatively short synthetic peptides that 
can mimic epitopes of a protein can be used to stimulate 
the production of antibodies against the protein (see, for 
example, Sutcliffe et al., Science 219:660 (1983)). 
Accordingly, antigenic epitope-bearing peptides and 
polypeptides of the present invention are useful to raise 
antibodies that bind with the polypeptides described 
herein. 

Antigenic epitope-bearing peptides and 
polypeptides preferably contain at least four to ten amino 
acids, at least ten to fifteen amino acids, or about 15 to 
about 30 amino acids of SEQ ID NO: 2. Such epitope-bearing 
peptides and polypeptides can be produced by fragmenting 
an zCTGF4 polypeptide, or by chemical peptide synthesis, 
as described herein. Moreover, epitopes can be selected 
by phage display of random peptide libraries (see, for 
example, Lane and Stephen, Curr. Opin. Immunol. 5:268 
(1993), and Cortese et al . , Curr. Opin. Biotechnol . 7:616 
(1996) ) . Standard methods for identifying epitopes and 
producing antibodies from small peptides that comprise an 
epitope are described, for example, by Mole r "Epitope 
Mapping," in Methods in Molecular Biology, Vol. 10, Manson 
(ed.), pages 105-116 (The Humana Press, Inc. 1992), Price, 
"Production and Characterization of Synthetic Peptide- 
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Derived Antibodies, " in Monoclonal Antibodies: Production, 
Engineering, and Clinical Application, Ritter and Ladyman 
(eds.), pages 60-84 (Cambridge University Press 1995), and 
Coligan et al. (eds.), Current Protocols in Immunology, 
5 pages 9.3.1 - 9.3.5 and pages 9.4.1 - 9.4.11 (John Wiley & 
Sons 1997) . 

Regardless of the particular nucleotide sequence 
of a variant zCTGF4 gene, the gene encodes a polypeptide 
that is characterized by its proliferative or 

10 differentiating activity, or ability to induce specialized 
cell functions, or by the ability to bind specifically to 
an anti-zCTGF4 antibody. More specifically, variant 
zCTGF4 genes encode polypeptides which exhibit at least 
50% and preferably, greater than 70, 80 or 90%, of the 

15 activity of polypeptide encoded by the human zCTGF4 gene 

described herein. 

For any zCTGF4 polypeptide, including variants 

and fusion proteins, one of ordinary skill in the art can 

readily generate a fully degenerate polynucleotide 

2 0 sequence encoding that variant using the information set 

forth in Tables 1 and 2 above. 

The present invention further provides a variety 
of other polypeptide fusions (and related multimeric 
proteins comprising one or more polypeptide fusions) . For 
25 example, a zCTGF4 polypeptide can be prepared as a fusion 
to a dimerizing protein as disclosed in U.S. Patents Nos. 
5,155,027 and 5,567,584. Preferred dimerizing proteins in 
this regard include immunoglobulin constant region 
domains. Immunoglobulin- ZCTGF4 polypeptide fusions can 

3 0 be expressed in genetically engineered cells (to produce a 

variety of multimeric zCTGF4 analogs) . Auxiliary domains 
can be fused to zCTGF4 polypeptides to target them to 
specific cells, tissues, or macromolecules (e.g., 
collagen) . For example, a zCTGF4 polypeptide or protein 
3 5 could be targeted to a predetermined cell type by fusing a 
ZCTGF4 polypeptide to a ligand that specifically binds to 
a receptor on the surface of the target cell. In this 
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way, polypeptides and proteins can be targeted for 
therapeutic or diagnostic purposes. A zCTGF4 polypeptide 
can be fused to two or more moieties, such as an affinity 
tag for purification and a targeting domain. Polypeptide 
fusions can also comprise one or more cleavage sites, 
particularly between domains. See, Tuan et al . , 

Connective Tissue Research 34:1-9, 1996. 

A Hopp/Woods hydrophilicity profile of the 
zCTGF4 protein sequence as shown in SEQ ID NO: 2 can be 
generated (Hopp et al . , Proc . Natl. Acad. Sci.78 :3824- 
3828, 1981; Hopp, J. Immun . Meth. 88.:1-18, 1986- and 
Triquier et al . , Protein Engineering 11^:153-169, 1998). 
The profile is based on a sliding six-residue window. 
Buried G, S, and T residues and exposed H, Y, and W 
residues were ignored. Hydrophilicity can be used to 
determine regions that have the most antigenic potential. 
For example, in zCTGF4, hydrophilic regions include amino 
acid residues 239-244 of SEQ ID NO: 2, amino acid residues 
105-110 of SEQ ID NO: 2, amino acid residues 172-177 of 
SEQ ID NO: 2, amino acid residues 238-243 of SEQ ID NO: 2, 
and amino acid residues 171-176 of SEQ ID NO: 2. 

Using the methods discussed herein, one of 
ordinary skill in the art can identify and/or prepare a 
variety of polypeptides that are have substantially 
similar sequence identity to residues 1 or 24 to 3 54 of 
SEQ ID NO: 2, or functional fragments and fusions thereof, 
and retain the properties of the wild-type protein such as 
the ability to stimulate proliferation, differentiation or 
induce specialized cell function. 

The polypeptides of the present invention, 
including full-length proteins, fragments thereof and 
fusion proteins, can be produced in genetically engineered 
host cells according to conventional techniques. Suitable 
host cells are those cell types that can be transformed or 
transfected with exogenous DNA and grown in culture, and 
include bacteria; fungal cells, and cultured higher 
eukaryotic cells. Eukaryotic cells, particularly cultured 
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cells of multicellular organisms, are preferred. 
Techniques for manipulating cloned DNA molecules and 
introducing exogenous DNA into a variety of host cells are 
disclosed by Sambrook et al . , Molecular Cloning: A 
5 Laboratory Manual , 2nd ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989, and Ausubel et al . , 
eds., Current Protocols in Molecular Biology , John Wiley 
and Sons, Inc., NY, 1987. 

In general, a DNA sequence encoding a ZCTGF4 
10 polypeptide is operably linked to other genetic elements 
required for its expression, generally including a 
transcription promoter and terminator, within • an 
expression vector. The vector will also commonly contain 
}», one or more selectable markers and one or more origins of 

O 15 replication, although those skilled in the art will 
i ; recognize that within certain systems selectable markers 

fxj may be provided on separate vectors, and replication of 

*tj the exogenous DNA may be provided by integration into the 

s host cell genome. Selection of promoters, terminators, 

2 0 selectable markers, vectors and other elements is a matter 
f % of routine design within the level of ordinary skill in 

111 the art. Many such elements are described in the 

literature and are available through commercial suppliers. 

To direct a zCTGF4 polypeptide into the 
25 secretory pathway of a host cell, a secretory signal 
sequence (also known as a leader sequence, prepro sequence 
or pre sequence) is provided in the expression vector. 
The secretory signal sequence may be that of the zCTGF4 
polypeptide, or may be derived from another secreted 
30 protein (e.g., t-PA) or synthesized de novo. The 
secretory signal sequence is operably linked to the zCTGF4 
DNA sequence, i.e., the two sequences are joined in the 
correct reading frame and positioned to direct the newly 
synthesized polypeptide into the secretory pathway of the 
35 host cell. Secretory signal sequences are commonly 

positioned 5 f to the DNA sequence encoding the polypeptide 



42 



of interest, although certain secretory signal sequences 
may be positioned elsewhere in the DNA sequence of 
interest (see, e.g., Welch et al . , U.S. Patent No. 
5,037,743; Holland et al . , U.S. Patent No. 5,143,830). 
5 Alternatively, the secretory signal sequence 

contained in the polypeptides of the present invention is 
used to direct other polypeptides into the secretory 
pathway. The present invention provides for such fusion 
polypeptides. A signal fusion polypeptide can be made 

10 wherein a secretory signal sequence derived from amino 
acid residue 1 to 23 of SEQ ID NO: 2 is operably linked to 
a DNA sequence encoding another polypeptide using methods 
known in the art and disclosed herein. The secretory 
signal sequence contained in the fusion polypeptides of 

15 the present invention is preferably fused amino -terminally 
to an additional peptide to direct the additional peptide 
into the secretory pathway. Such constructs have numerous 
applications known in the art. For example, these novel 
secretory signal sequence fusion constructs can direct the 

2 0 secretion of an active component of a normally non- 
secreted protein. Such fusions may be used in vivo or in 

vitro to direct peptides through the secretory pathway. 

Cultured mammalian cells are suitable hosts 
within the present invention. Methods for introducing 

25 exogenous DNA into mammalian host cells include calcium 
phosphate -mediated transfection (Wigler et al . , Cell 
14:725, 1978; Corsaro and Pearson, Somatic Cell Genetics 
7.: 603, 1981: Graham and Van der Eb, Virology 52:456, 
1973), electroporation (Neumann et al . , EMBO J. l.:841-845, 

30 1982), DEAE-dextran mediated transfection (Ausubel et al . , 
ibid . ) , and liposome-mediated transfection (Hawley-Nelson 
et al., Focus 15:73, 1993; Ciccarone et al . , Focus 15:80, 
1993) , and viral vectors (Miller and Rosman, BioTechniques 
7:980-90, 1989; Wang and Finer, Nature Med. 2:714-6, 

35 1996) . ■ The production of recombinant polypeptides in 
cultured mammalian cells is disclosed, for example, by 
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Le Vinson et al., U.S. Patent No. 4,713,339; Hagen et al . , 
U.S. Patent No. 4,784,950; Palmiter et al . , U.S. Patent 
No. 4,579,821; and Ringold, U.S. Patent No. 4,656,134. 
Suitable cultured mammalian cells include the COS-1 (ATCC 
5 No. CRL 1650), COS-7 (ATCC No. CRL 1651), BHK (ATCC No. 
CRL 1632), BHK 570 (ATCC No. CRL 10314), 293 (ATCC No. CRL 
1573; Graham et al . , J. Gen. Virol . 36:59-72, 1977) and 

Chinese hamster ovary (e.g. CHO-K1; ATCC No. CCL 61) cell 
lines. Additional suitable cell lines are known in the 
10 art and available from public depositories such as the 
American Type Culture Collection, Manassas, VA. In 
general, strong transcription promoters are preferred, 
L _ ; such as promoters from SV-40 or cytomegalovirus. See, 

O e.g., U.S: Patent No. 4,956,288. Other suitable promoters 

15 include those from metallothionein genes (U.S. Patent Nos . 
y, 4,579,821 and 4,601,978) and the adenovirus major late 

promoter. 

yp Drug selection is generally used to select for 

~ cultured mammalian cells into which foreign DNA has been 

L,..r 

[ 5 2 0 inserted. Such cells are commonly referred to as 
Ct " transf ectants" . Cells that have been cultured in the 

—vr.'r 

~ i 

:ij presence of the selective agent and are able to pass the 

jU gene of interest to their progeny are referred to as 

"stable transf ectants . " A preferred selectable marker is 

25 a gene encoding resistance to the antibiotic neomycin. 
Selection is carried out in the presence of a neomycin- 
type drug, such as G-418 or the like. Selection systems 
can also be used to increase the expression level of the 
gene of interest, a process referred to as 

30 "amplification." Amplification is carried out by 

culturing transf ectants in the presence of a low level of 
the selective agent and then increasing the amount of 
selective agent to select for cells that produce high 
levels of the products of the introduced genes. A 

3 5 preferred amplifiable selectable marker is dihydrof olate 
reductase, which confers resistance to methotrexate. 
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Other drug resistance genes (e.g. hygromycin resistance, 
multi-drug resistance, puromycin acetyltransf erase) can 
also be used. Alternative markers that introduce an 
altered phenotype, such as green fluorescent protein, or 

-5 cell surface proteins such as CD4, CD8 , Class I MHC, 
placental alkaline phosphatase may be used to sort 
transfected cells from untransf ected cells by such means 
as FACS sorting or magnetic bead separation technology. 

Other higher eukaryotic cells can also be used 

10 as hosts, including plant cells, insect cells and avian 
cells. The use of Agrobacterium rhizogenes as a vector 
for expressing genes in plant cells has been reviewed by 
Sinkar et al., J. Biosci . ^Bangalore ) 11:47-58, 1987. 

Transformation of insect cells and production of foreign 

15 polypeptides therein is disclosed by Guarino et al . , U.S. 
Patent No. 5,162,222 and WIPO publication WO 94/06463. 
Insect cells can be infected with recombinant baculovirus, 
commonly derived from Autographs calif ornica nuclear 
polyhedros is virus (AcNPV) . DNA encoding the ZCTGF4 

20 polypeptide is inserted into the baculoviral genome in 
place of the AcNPV polyhedrin gene coding sequence by one 
of two methods. The first is the traditional method of 
homologous DNA recombination between wild- type AcNPV and a 
transfer vector containing the zCTGF4 flanked by AcNPV 

25 sequences. Suitable insect cells, e.g. SF9 cells, are 
infected with wild-type AcNPV and transfected with a 
transfer vector comprising a zCTGF4 polynucleotide 
operably linked to an AcNPV polyhedrin gene promoter, 
terminator, and flanking sequences. See, King, L.A. and 

30 Possee, R.D., The Baculovirus Expression System: A 

Laboratory Guide , London, Chapman & Hall; O'Reilly, D.R. 

et al . , Baculovirus Expression Vectors: A Laboratory 

Manual , New York, Oxford University Press., 1994; and, 
Richardson, C. D., Ed., Baculovirus Expression Protocols. 

35 Methods in Molecular Biology, Totowa, NJ, Humana Press, 
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1995. Natural recombination within an insect cell will 
result in a recombinant baculovirus which contains zCTGF4 
driven by the polyhedrin promoter. Recombinant viral 
stocks are made by methods commonly used in the art . 
5 The second method of making recombinant 

baculovirus utilizes a transposon-based system described 
by Luckow (Luckow, V.A, et al . , J Virol 67:4566-79, 1993). 
This system is sold in the Bac-to-Bac kit (Life 
Technologies, Rockville, MD) . This system utilizes a 
10 transfer vector, pFastBacl™ (Life Technologies) 

containing a Tn7 transposon to move the DNA encoding* the 
zCTGF4 polypeptide into a baculovirus genome maintained in 
E. coli as a large plasmid called a "bacmid. " The 
pFastBacl™ transfer vector utilizes the AcNPV polyhedrin 
15 promoter to drive the expression of the gene of interest, 
in this case zCTGF4 . However, pFastBacl™ can be modified 

to a considerable degree. The polyhedrin promoter can be 
removed and substituted with the baculovirus basic protein 
promoter (also known as Pcor, p6 . 9 or MP promoter) which 

2 0 is expressed earlier in the baculovirus infection, and has 

been shown to be advantageous for expressing secreted 
proteins. See, Hill - Perkins , M.S. and Possee, R.D., J. 
Gen. Virol. 71:971-6, 1990; Bonning, B.C. et al . , J . Gen . 
Virol . 75:1551-6, 1994; and, Chazenbalk, G.D., and 
25 Rapoport, B., J. Biol. Chem. 270 :1543-9, 1995. In such 
transfer vector constructs, a short or long version of the 
basic protein promoter can be used. Moreover, transfer 
vectors can be constructed which replace the native zCTGF4 
secretory signal sequences with secretory signal sequences 

3 0 derived from insect proteins. For example, a secretory 

signal sequence from Ecdysteroid Glucosyltransf erase 
(EGT) , honey bee Melittin (Invitrogen, Carlsbad, CA) , or 
baculovirus gp67 (PharMingen, San Diego, CA) can be used 
in constructs to replace the native zCTGF4 secretory 
35 signal sequence. In addition, transfer vectors can 
include an in-frame fusion with DNA encoding an epitope 



tag at the C- or N-terminus of the expressed zCTGF4 
polypeptide, for example, a Glu-Glu epitope tag 
(Grussenmeyer, T. et al . , Proc . Natl . Acad. Sci . 82:7952- 
4, 1985) . Using a technique known in the art, a transfer 
vector containing zCTGF4 is transformed into E. Coli , and 
screened for bacmids which contain an interrupted lacZ 
gene indicative of recombinant baculovirus. The bacmid 
DNA containing the recombinant baculovirus genome is 
isolated, using common techniques, and used to transfect 

* 

Spodoptera frugiperda cells, e.g. Sf9 cells. Recombinant 

virus that expresses ZCTGF4 is subsequently produced. 
Recombinant* viral stocks are made by methods commonly 'used 
the art . 

The recombinant virus is used to infect host 
cells, typically a cell line derived from the fall 
armyworm, Spodoptera frugiperda. See, in general, Glick 
and Pasternak, Molecular Biotechnology : Principles and 
Applications of Recombinant DNA , ASM Press, Washington, 
D.C., 1994. Another suitable cell line is the High 
FiveO™ cell line (Invitrogen) derived from Trichoplusla 
ni (U.S. Patent No. 5,300,435). Commercially available 
serum- free media are used to grow and maintain the cells. 
Suitable media are Sf900 II™ (Life Technologies) or ESF 

921™ (Expression Systems) for the Sf9 cells; and Ex- 
cellO405™ (JRH Biosciences, Lenexa, KS) or Express FiveO™ 
(Life Technologies) for the T. ni cells. The cells are 
grown up from an inoculation density of approximately 2-5 
x 10 5 cells to a density of 1-2 x 10 6 cells at which time 
a recombinant viral stock is added at a multiplicity of 
infection (MOI) of 0.1 to 10, more typically near 3. The 
recombinant virus -infected cells typically produce the 
recombinant ZCTGF4 polypeptide at 12-72 hours post- 
infection and secrete it with varying efficiency into the 
medium. The culture is usually harvested 4 8 hours post- 
infection. Centrifugation is used to separate the cells 
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from the medium (supernatant) . The supernatant containing 
the z CTGF4 polypeptide is filtered through micropore 
filters, usually 0.45 /xm pore size. Procedures used are 
generally described in available laboratory manuals (King, 
5 L. A. and Possee, R.D., ibid . ; O'Reilly, D.R. et al., 
ibid.; Richardson, C. D., ibid. ) . Subseguent purification 
of the zCTGF4 polypeptide from the supernatant can be 
achieved using methods described herein. 

Fungal cells, including yeast cells, can also be 
10 used within the present invention. Yeast species of 
particular interest in this regard include Saccharomyces 

m 

cerevisiae, Pichia pastoris, and Pichia methanolica. 

Methods for transforming S. cerevisiae cells with 

exogenous DNA and producing recombinant polypeptides 
15 therefrom are disclosed by, for example, Kawasaki, U.S. 
Patent No. 4,599,311; Kawasaki et al . , U.S. Patent No. 
4,931,373; Brake, U.S. Patent No. 4,870,008; Welch et al . , 
U.S. Patent No. 5,037,743; and Murray et al., U.S. Patent 
No. 4,845,075. Transformed cells are selected by 

20 phenotype determined by the selectable marker, commonly 
drug resistance or the ability to grow in the absence of a 
particular nutrient (e.g., leucine). A preferred vector 
system for use in Saccharomyces cerevisiae is the POT1 

vector system disclosed by Kawasaki et al . (U.S. Patent 
25 No. 4,931,373), which allows transformed cells .to be 
selected by growth in glucose- containing media. Suitable 
promoters and terminators for use in yeast include those 
from glycolytic enzyme genes (see, e.g., Kawasaki, U.S. 
Patent No. 4,599,311; Kingsman et al . , U.S. Patent No. 
30 4,615,974; and Bitter, U.S. Patent No. 4,977,092) and 
alcohol dehydrogenase genes. See also U.S. Patents Nos . 
4,990,446; 5,063,154; 5,139,936 and 4,661,454. 

Transformation systems for other yeasts, including 
Hansenula polymorpha , Schizosaccharomyces pombe, 

35 Kluyveromyces lactis, Kluyveromyces fragilis , Ustilago 
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maydis, Pichia pastoris , Pichia methanolica , Pichia 
guillermondii and Candida maltosa are known in the art. 
See, for example, Gleeson et al., J. Gen. Microbiol . 
132 :3453-65, 1986 and Cregg, U.S. Patent No. 4,882,279. 
5 Aspergillus cells may be utilized according to the methods 

of McKnight et al . , U.S. Patent No. 4,935,349. Methods 
for transforming Acremonium chrysogenum are disclosed by 
Sumino et al . , U.S. Patent No. 5,162,228. Methods for 
transforming Neurospora are disclosed by Lambowitz, U.S. 

10 Patent No. 4,486,533. 

The use of Pichia. methanolica as host for the 

production of recombinant proteins is disclosed in WIPO 
P; Publications WO 97/17450, WO 97/17451, WO 98/02536, and WO 

Q 98/02565. DNA molecules for use in transforming P. 

y ; 15 methanolica will commonly be prepared as double -stranded, 

J*;! circular plasmids, which are preferably linearized prior 

yp to transformation. For polypeptide production in P. 

methanolica, it is preferred that the promoter and 

terminator in the plasmid be that of a P. methanolica 

IJ : j 20 gene, such as a P. methanolica alcohol utilization gene 

|T (AUG1 or AUG2) . Other useful promoters include those of 

the dihydroxyacetone synthase (DHAS) , formate 

dehydrogenase (FMD)- , and catalase (CAT) genes. To 
facilitate integration of the DNA into the host 

25 chromosome, it is preferred to have the entire expression 
segment of the plasmid flanked at both ends by host DNA 
sequences. A preferred selectable marker for use in 
Pichia methanolica is a P. methanolica ADE2 gene, which 

encodes phosphoribosyl - 5 -aminoimidazole carboxylase ( AIRC ; 
30 EC 4.1.1.21), which allows ade2 host cells to grow in the 

absence of adenine. For large-scale, industrial processes 
where it is desirable to minimize the use of methanol, it 
is preferred to use host cells in which both methanol 
utilization genes {AUG1 and AUG2) are deleted. For 
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production of secreted proteins, host cells deficient in 
vacuolar protease genes (PEP4 and PRB1) are preferred. 

Electroporation is used to facilitate the introduction of 
a plasmid containing DNA encoding a polypeptide of 
5 interest into P. methanolica cells. It is preferred to 

transform P. methanolica cells by electroporation using 

an exponentially decaying, pulsed electric field having a 
field strength of from 2.5 to 4.5 kV/cm, preferably about 

3.75 kV/cm, arid a time constant (x) of from 1 to 40 

10 milliseconds, most preferably about 20 milliseconds. 

Prokaryotic host cells, including strains of the 
bacteria Escherichia coli, Bacillus and other genera are 
also useful host cells within the present invention. 
Techniques for transforming these hosts and expressing 

15 foreign DNA sequences cloned therein are well known in the 
art (see, e.g., Sambrook et al . , ibid.}. When expressing 
a zCTGF4 polypeptide in bacteria such as E. coli, the 
polypeptide may be retained in the cytoplasm, typically as 
insoluble granules, or may be directed to the periplasmic 

2 0 space by a bacterial secretion sequence. In the former 

case, the cells are lysed, and the granules are recovered 
and denatured using, for example, guanidine isothiocyanate 
or urea. The denatured polypeptide can then be refolded 
and dimerized by diluting the denaturant, such as by 

25 dialysis against a solution of urea and a combination of 
reduced and oxidized glutathione, followed by dialysis 
against a buffered saline solution. In the latter case, 
the polypeptide can be recovered from the periplasmic 
space in a soluble and functional form by disrupting the 

30 cells (by, for example, sonication or osmotic shock) to 
release the contents of the periplasmic space and 
recovering the protein, thereby obviating the need for 
denaturation and refolding. 

Transformed or transfected host cells are 

3 5 cultured according to conventional procedures in a culture 

medium containing nutrients and other components required 
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71 (Toso Haas) and the like. Suitable solid supports 
include glass beads, silica-based resins, cellulosic 
resins, agarose beads, cross -linked agarose beads, 
polystyrene beads, cross -linked polyacrylamide resins and 
5 the like that are insoluble under the conditions in which 
they are to be used. These supports may be modified with 
reactive groups that allow attachment of proteins by amino 
groups, carboxyl groups, sulfhydryl groups, hydroxyl 
groups and/or carbohydrate moieties. Examples of coupling 

10 chemistries include cyanogen bromide activation, N- 
hydroxysuccinimide activation, epoxide activation, 
sulfhydryl activation, hydrazide activation, and carboxyl 
and amino derivatives for carbodiimide . coupling 
chemistries . These and other solid media are well known 

15 and widely used in the art, and are available from 
commercial suppliers. Methods for binding receptor 
polypeptides to support media are well known in the art. 
Selection of a particular method is a matter of routine 
design and is determined in part by the properties of the 

20 chosen support. See, for example, Affinity 

Chromatography : Principles & Methods , Pharmacia LKB 

Biotechnology, Uppsala, Sweden, 1988. 

The polypeptides of the present invention can be 
isolated by exploitation of size, charge and 

25 hydrophobicity . For example, immobilized metal ion 

adsorption (IMAC) chromatography can be used to purify 
histidine-rich proteins (E. Sulkowski, Trends in Biochem. 
3:1-7, 1985). Other methods of purification include 
purification of glycosylated proteins by lectin affinity 

30 chromatography and ion exchange chromatography ( Methods in 
Enzymol . , Vol. 182, "Guide to Protein Purification", M. 
Deutscher, (ed.), Acad. Press, San Diego, 1990, pp. 529- 
3 9) . Within additional embodiments of the invention, a 
fusion of the polypeptide of interest and an affinity tag 

35 (e.g., maltose-binding protein, an immunoglobulin domain) 
may be constructed to facilitate purification. ZCTGF4 has 
a domain homologous to the heparin binding domain 
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described previously for CTGF, and exploitation of this 
property may be useful for purification of zCTGF4 . For a 
review, see, Burgess et al . , Ann. Rev, of Biochem. 58 : 575- 
606, 1989. Members of the FGF family, which also have a 
5 heparin binding domain can be purified to apparent 
homogeneity by heparin- Sepharose affinity chromatography 
(Gospodarowicz et al . , Proc. Natl. Acad. Sci . 81:6963- 
6967, 1984) and eluted using linear step gradients of NaCl 
(Ron et al., , J. Biol. Chem. 268 (4) :2984-2988 , 1993; 

10 Chromatography : Principles & Methods , pp. 77-80, 

Pharmacia LKB Biotechnology, Uppsala, Sweden, 1993; in 
"Immobilized Affinity Ligand Techniques" , Hermanson et 
al . , eds., pp. 165-167, Academic Press, San Diego, 1992; 
Kjellen et al . , Ann. Rev. Biochem.Ann. Rev. Biochem. 

15 60:443-474, 1991; and Ke et al . , Protein Expr. Purif. 
3 (6) :497-507, 1992.) 

Protein refolding (and optionally reoxidation) 
procedures may be advantageously used. It is preferred to 
purify the protein to >80% purity, more preferably to >90% 

20 purity, even more preferably >95%, and particularly 
preferred is a pharmaceutically pure state, that is 
greater than 99.9% pure with respect to contaminating 
macromolecules, particularly other proteins and nucleic 
acids, and free of infectious and pyrogenic agents. 

25 Preferably, a purified protein is substantially free of 
other proteins, particularly other proteins of animal 
origin. 

ZCTGF4 polypeptides or fragments thereof may 
also be prepared through chemical synthesis (Merrifield, 

30 J. Am. Chem. Soc . 85,: 2149, 1963} . ZCTGF4 polypeptides may 
be monomers or multimers; glycosylated or non- 
glycosylated; . pegylated or non-pegylated; and may or may 
not include an initial methionine amino acid residue. 

The activity of molecules of the present 

3 5 invention can be measured using a variety of assays that 
measure cell proliferation, differentiation, chemotaxis or 
induction of specialized cell functions. Of particular 
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interest are changes in proliferation or differentiation 
of endothelial cells, particularly endothelial cells 
isolated from testis, trachea, bone marrow or kidney 
tissue. Proliferation and differentiation can be measured 
5 using in vitro using cultured cells or in vivo by 

administering molecules of the claimed invention to the 
appropriate animal model. Assays measuring cell 

proliferation or differentiation are well known in the 
art. For example, assays measuring proliferation include 

10 such assays as chemosensitivity to neutral red dye 
(Cavanaugh et al . , Investigational New Drugs 8: 347-354, 
1990, incorporated herein by reference) , incorporation of 
radiolabelled nucleotides (Cook et al., Analytical 
Biochem. 3/79:1-7, 1989, incorporated herein by reference) , 

15 incorporation of 5-bromo-2 1 -deoxyuridine (BrdU) in the DNA 
of proliferating cells (Porstmann et al . , J. Immunol. 
Methods 82.: 169-179, 1985, incorporated herein by 
reference) , and use of tetrazolium salts (Mosmann, J. 
Immunol . Methods 6_5:55-63, 1983; Alley et al . , Cancer Res . 

20 48:589-601, 1988; Marshall et al . , Growth Reg . j5:69-84, 
1995; and Scudiero et al . , Cancer Res. 48:4827-4833, 1988; 
all incorporated herein by reference) . Assays measuring 
differentiation include, for example, measuring cell- 
surface markers associated with stage-specific expression 

25 of a tissue, enzymatic activity, functional activity or 
morphological changes (Watt, FASEB, 5:281-284, 1991; 
Francis, Differentiation J57: 63-75, 1994; Raes, Adv . Anim . 
Cell Biol. Techno! . Bioprocesses , 161-171, 1989; all 
incorporated herein by reference) . 

3 0 Examples of assays measuring induction of 

specialized cell functions include: extracellular matrix 
protein mRNA induction assays (Frazier et al . , J. Invest. 
Dermatol . 107 :404-411, 1996) ; 35 S methionine pulse-chase 
assays measuring stimulation of matrix protein synthesis 

35 (Frazier et al . , ibid., 1996); subcutaneous administration 
of growth factors to mice {Roberts et al . , Proc. Natl. 



54 



Acad. Sci. USA 83:4167-4171, 1986); and in situ 
hybridization to measure changes in mRNA expression (Fava 
et al. , Blood 76:1946-1955, 1990). 

An exemplary in vivo assay is when mammalian 
5 transfected (or co- transfected) expression host cells may 
be embedded in an alginate environment and injected 
(implanted) into recipient animals. Alginate-poly-L- 
lysine microencapsulation, permselective membrane 
encapsulation and diffusion chambers have been described 

10 as a means to entrap transfected mammalian cells or 
primary mammalian cells. These types of non- immunogenic 
"encapsulations" or microenvironments permit the transfer 
of nutrients into the microenvironment , and also permit 
the diffusion of proteins and other macromolecules 

15 secreted or released by the captured cells across the 
environmental barrier to the recipient animal. Most 
importantly, the capsules or microenvironments mask and 
shield the foreign, embedded cells from the recipient 
animal f s immune response. Such microenvironments can 

20 extend the life of the injected cells from a few hours or 
days (naked cells) to several weeks (embedded cells) . 

An in vivo approach for assaying proteins of the 
present invention involves viral delivery systems. 
Exemplary viruses for this purpose include adenovirus, 

25 herpesvirus, retroviruses, vaccinia virus, and adeno- 
associated virus (AAV) . Adenovirus, a double-stranded DNA 
virus, is currently the best studied gene transfer vector 
for delivery of heterologous nucleic acid {for review, see 
Becker et al., Meth. Cell Biol. 43:161-89, 1994; and 

30 Douglas and Curiel, Science & Medicine £:44-53, 1997). 
The adenovirus system offers several advantages: (i) 
adenovirus can accommodate relatively large DNA inserts; 
(ii) can be grown to high-titer; (iii) infect a broad 
range of mammalian cell types; and (iv) can be used with 

35 many different promoters including ubiquitous, tissue 
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specific, and regulatable promoters. Also, because 
adenoviruses are stable in the bloodstream, they can be 
administered by intravenous injection. 

Using adenovirus vectors where portions of the 
5 adenovirus genome are deleted, inserts are incorporated 
into the viral DNA by direct ligation or by homologous 
recombination with a co-transf ected plasmid. In an 
exemplary system, the essential El gene has been deleted 
from the viral vector, and the virus will not replicate 
10 unless the El gene is provided by the host cell (the human 
2 93 cell line is exemplary) . When intravenously 

administered to intact animals, adenovirus primarily 
targets the liver. If the adenoviral delivery system has 
an - El gene deletion, the virus cannot replicate in the 



ro is host cells. However, the host's tissue (e.g., liver) will 

If! 

J: express and process (and, if a secretory signal sequence 

f is present, secrete) the heterologous protein. Secreted 

proteins will enter the circulation in the highly 
vascularized liver, and effects on the infected animal can 
20 be determined. 

Moreover, adenoviral vectors containing various 
deletions of viral genes can be used in an attempt to 
reduce or eliminate immune responses to the vector. Such 
adenoviruses are El deleted, and in addition contain 
25 deletions of E2A or E4 (Lusky, M. et al., J» Virol. 
72. :2022-2032, 1998; Raper, S.E. et al . , Human Gene Therapy 
9:671-679, 1998). In addition, deletion of E2b is 
reported to reduce immune responses (Amalfitano, A. et 
al., J. Virol . 72:926-933, 1998). Moreover, by deleting 
30 the entire adenovirus genome, very large inserts of 
heterologous DNA can be accommodated. Generation of so 
called "gutless" adenoviruses where all viral genes are 
deleted are particularly advantageous for insertion of 
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large inserts of heterologous DNA. For review, see Yeh, 
P. and Perricaudet, M., FASEB J. 11 :615-623, 1997. 

The adenovirus system can also be used for 
protein production in vitro. By culturing adenovirus- 
5 infected non-293 cells under conditions where the cells 
are not rapidly dividing, the cells can produce proteins 
for extended periods of time. For instance, BHK cells are 
grown to confluence in cell factories, then exposed to the 
adenoviral vector encoding the secreted protein of 

10 interest . The cells are then grown under serum-free 
conditions, which allows infected cells to survive, for 
several weeks without significant cell division. 
Alternatively, adenovirus vector infected 293 cells can be 
grown as adherent cells or in suspension culture at 

15 relatively high cell density to produce significant 
amounts of protein (See Garnier et al * , Cytotechnol 
15^:145-155, 1994). With either protocol, an expressed, 
secreted heterologous protein can be repeatedly isolated 
from the cell culture supernatant, lysate, or membrane 

20 fractions depending on the disposition of the expressed 
protein in the cell* Within the infected 293 cell 
production protocol, non-secreted proteins may also be 
effectively obtained. 

Assays can be used to measure other cellular 

25 responses, that include, chemotaxis, adhesion, changes in 
ion channel influx, regulation of second messenger levels 
and neurotransmitter release. Such assays are well known 
in the art. See, for example, in "Basic & Clinical 
Endocrinology Ser. , Vol. Vol. 3," Cytochemical Bioassavs : 

30 Techniques & Applications , Chayen; Chayen, Bitensky, eds., 
Dekker, New York, 1983. 

In view of the tissue distribution observed for 
this zCTGF4 , agonists (including the natural ligand) and 
antagonists have enormous potential in both in vitro and 

35 in vivo applications. Compounds identified as zCTGF4 
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agonists are useful for stimulating proliferation and/or 
differentiation of cells in culture. For example, agonist 
compounds are useful as components of defined cell culture 
media, and may be used alone or in combination with other 
5 cytokines and hormones to replace serum that is commonly 
used in cell culture. Agonists are thus useful in 
specifically promoting the growth and/or development of 
cells derived from testis, trachea, bone marrow or kidney 
tissues or endothelial and ligament -derived fibroblast 

10 cells in culture. 

Agonists (including zCTGF4) will be useful- for 
increasing production of extracellular matrix components, 
and may be used in the treatment of connective tissue. 
Particularly, agonists will be useful as treatment for 

15 ligaments, cartilage and tendons. By virtue of the tissue 
distribution for the expression of these molecules, 
agonists will be useful for enhancing healing and 
stabilizing wounds and as a component artificial skin. 
The presence of zCTGF4 expression in the bone marrow 

20 suggests that the molecules of the present invention play 
a role in hematopoiesis . That role is likely indirect, 
where stromal cells within the architecture of the bone 
marrow secrete zCTGF4, modulating production of cells of 
the hematopoietic lineage. 

25 Antagonists will be useful for inhibiting 

expression of specialized cell functions, such as 
production of extracellular components and inhibition of 
cell proliferation. Genes encoding polypeptides having 
potential ZCTGF4 polypeptide binding domains can be 

3 0 obtained by screening random peptide libraries displayed 
on phage (phage display)- or on bacteria, such as E. coll. 

Nucleotide sequences encoding the polypeptides can be 
obtained in a number of ways, such as through random 
mutagenesis and random polynucleotide synthesis. These 
35 random peptide display libraries can be used to screen for 
peptides which interact with a known target which can be a 
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protein or polypeptide, such as a ligand or receptor, a 
biological or synthetic macromolecule , or organic or 
inorganic substances. Techniques for creating and 

screening such random peptide display libraries are known 
5 in the art (Ladner et al . , U.S. Patent 5,223,409; Ladner 
et al., U.S. Patent 4,946,778; Ladner et al., U.S. Patent 
5,403,484 and Ladner et al., U.S. Patent 5,571,698) and 
random peptide display libraries and kits for screening 
such libraries are available commercially, for instance 

10 from Clontech (Palo Alto, CA) , Invitrogen Inc. (San Diego, 
CA) , New England Biolabs, Inc. (Beverly, MA) and Pharmacia 
LKB Biotechnology Inc. (Piscataway, NJ) . Random peptide 
display libraries can be screened using the ZCTGF4 
sequences disclosed herein to identify proteins which bind 

15 to zCTGF4 . These "binding proteins" which interact with 
zCTGF4 polypeptides may be used for tagging cells; for 
isolating homolog polypeptides by affinity purification; 
they can be directly or indirectly conjugated to drugs, 
toxins, radionuclides and the like. These binding 

20 proteins can also be used in analytical methods such as 
for screening expression libraries and neutralizing 
activity. The binding proteins can also be used for 

diagnostic assays for determining circulating levels of 
polypeptides; for detecting or quantitating soluble 

25 polypeptides as marker of underlying pathology or disease. 
These binding proteins can also act as zCTGF4 
"antagonists" to block zCTGF4 binding and signal 
transduction in vitro and in vivo. These anti- zCTGF4 
binding proteins would be useful for inhibiting expression 

30 of genes which result in proliferation, differentiation or 
induction of specialized cell functions, such as 
production of excellular matrix. Such anti-zCTGF4 binding 
proteins can be used for treatment in bone marrow 
fibrosis, modulating production or differentiation of 

35 hematopoietic cells, prevention of scar tissue formation, 
cutaneous lupus erythematosis , scleroderma, dermatositis , 
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and end-stage kidney failure, alone or combination with 
other therapies. 

zCTGF4 can also be used to identify inhibitors 
(antagonists) of its activity. Test compounds are added 
5 to the assays disclosed herein to identify compounds that 
inhibit the activity of zCTGF4 . In addition to those 
assays disclosed herein, samples can be tested for 
inhibition of zCTGF4 activity within a variety of assays 
designed to measure receptor binding or the 

10 stimulation/inhibition of zCTGF4 -dependent cellular 
responses. For example, ZCTGF4- responsive cell lines* can 
be transfected with a reporter gene construct that is 
responsive to a ZCTGF4- stimulated cellular pathway. 
Reporter gene constructs of this type are known in the 

15 art, and will generally comprise a zCTGF4 -DNA response 
element operably linked to a gene encoding an assayable 
protein, such as lucif erase. DNA response elements can 
include, but are not limited to, cyclic AMP response 
elements (CRE) , hormone response elements (HRE) insulin 

20 response element (IRE) (Nasrin et al . , Proc. Natl. Acad. 
Sci . USA .87:5273-7, 1990) and serum response elements 
(SRE) (Shaw et . al . Cell 56: 563-72, 1989) . Cyclic AMP 
response elements are reviewed in Roestler et al . , J. 
Biol. Chem. 263 (19) : 9063-6; 1988 and Habener, Molec. 

25 Endocrinol . 4 (8) : 1087-94; 1990. Hormone response 

elements are reviewed in Beato, Cell 5_6:335~44; 1989. 
Candidate compounds, solutions, mixtures or extracts are 
tested for the ability to inhibit the activity of zCTGF4 
on the target cells as evidenced by a decrease in zCTGF4 

3 0 stimulation of reporter gene expression. Assays of this 
type will detect compounds that directly block zCTGF4 
binding to cell -surface receptors, as well as compounds 
that block processes in the cellular pathway subsequent to 
receptor- ligand binding. In the alternative, compounds or 

3 5 other samples can be tested for direct blocking of zCTGF4 
binding to receptor using zCTGF4 tagged with a detectable 
label (e.g., 125 I, biotin, horseradish peroxidase, FITC, 
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and the like) . Within assays of this type, the ability of 
a test sample to inhibit the binding of labeled zCTGF4 to 
the receptor is indicative of inhibitory activity, which 
can be confirmed through secondary assays. Receptors used 
within binding assays may be cellular receptors or 
isolated, immobilized receptors. 

A ZCTGF4 polypeptide can be expressed as a 
fusion with an immunoglobulin heavy chain constant region, 
typically an F c fragment, which contains two constant 

region domains and lacks the variable region. Methods for 
preparing such fusions are disclosed in U.S. Patents Nos. 
5,155,027 and 5,567,584. Such fusions are typically 
secreted as multimeric molecules wherein the Fc portions 
are disulfide bonded to each other and two non-Ig 
polypeptides are arrayed in closed proximity to each 
other. Fusions of this type can be used to affinity 
purify ligand, as an in vitro assay tool, or antagonist. 
For use in assays, the chimeras are bound to a support via 
the F c region and used in an ELISA format . 

A zCTGF4 ligand-binding polypeptide can also be 
used for purification of ligand. The polypeptide is 
immobilized on a solid support, such as beads of agarose, 
cross-linked agarose, glass, cellulosic resins, silica- 
based resins, polystyrene, cross-linked polyacrylamide, or 
like materials that are stable under the conditions of 
use. Methods for linking polypeptides to solid supports 
are known in the art, and include amine chemistry, 
cyanogen bromide activation, N-hydroxysuccinimide 
activation, epoxide activation, sulfhydryl activation, and 
hydrazide activation. The resulting medium will generally 
be configured in the form of a column, and fluids 
containing ligand are passed through the column one or 
more times to allow ligand to bind to the receptor 
polypeptide. The ligand is then eluted using changes in 
salt concentration, chaotropic agents (guanidine HCl) , or 
pH to disrupt ligand-receptor binding. 
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An assay system that uses a ligand- binding 
receptor (or an antibody, one member of a complement/ 
ant i- complement pair) or a binding fragment thereof, and a 
commercially available biosensor instrument (BIAcore, 
5 Pharmacia Biosensor, Piscataway, NJ) may be advantageously 
employed. Such receptor, antibody, member of a 

complement /ant i- complement pair or fragment is immobilized 
onto the surface of a receptor chip. Use of this 
instrument is disclosed by Karlsson, J. Immunol. Methods 

10 145 :229-40, 1991 and Cunningham and Wells, J. Mol , Biol. 
234 :554-63, 1993. A receptor, antibody, member" or 
fragment is covalently attached, using amine or sulfhydryi 
chemistry, to dextran fibers that are attached to gold 
film within the flow cell. A test sample is passed 

15 through the cell. If a ligand, epitope, or opposite 
member of the complement /anti- complement pair is present 
in the sample, it will bind to the immobilized receptor, 
antibody or member, respectively, causing a change in the 
refractive index of the medium, which is detected as a 

20 change in surface plasmon resonance of the gold film. 
This system allows the determination of on- and off -rates, 
from which binding affinity can be calculated, and 
assessment of stoichiometry of binding. 

Ligand -binding receptor polypeptides can also be 

25 used within other assay systems known in the art. Such 
systems include Scatchard analysis for determination of 
binding affinity (see Scatchard, Ann. NY Acad. Sci . 51 : 
660-72, 1949) and calorimetric assays (Cunningham et al . , 
Science 253 :545-48, 1991; Cunningham et al . , Science 

30 245 :821-25, 1991). 

ZCTGF4 polypeptides can also be used to prepare 
antibodies that bind to zCTGF4 epitopes, peptides or 
polypeptides. The zCTGF4 polypeptide or a fragment thereof 
serves as an antigen (immunogen) to inoculate an animal 

35 and elicit an immune response. One of skill in the art 
would recognize that antigenic, epitope-bearing 
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polypeptides contain a sequence of at least 6, preferably 
at least 9, and more preferably at least 15 to about 30 
contiguous amino acid residues of a zCTGF4 polypeptide 
(e.g., SEQ ID NO: 2). Polypeptides comprising a larger 
5 portion of a ZCTGF4 polypeptide, i.e., from 30 to 10 
residues up to the entire length of the amino acid 
sequence are included. Antigens or immunogenic epitopes 
can also include attached tags, adjuvants and carriers, as 
described herein. Suitable antigens include the zCTGF4 

10 polypeptide encoded by SEQ ID NO: 2 from amino acid number 
24 to amino acid number 354, or a contiguous 9 to- 330 
amino acid fragment thereof. Other suitable antigens 
include the IBP domain (residues 59-102) , von Willebrand 
factor c domain (residues 114-179) , variable domain 

15 (residues 180-208) , and sulfated glyconcon jugate binding 
motif domain (residues 209-252) as disclosed herein. 
Preferred peptides to use as antigens are hydrophilic 
peptides such as those predicted by one of skill in the 
art from a hydrophobicity plot (See Figure) . ZCTGF4 

2 0 hydrophilic peptides include peptides comprising amino 
acid sequences selected from the group consisting of: 
residues 239-244, residues 105-110, residues 172-177, 
residues 238-243 and residues 171-176, all of SEQ ID NO: 2. 
Antibodies from an immune response generated by 

25 inoculation of an animal with these antigens can be 
isolated and purified as described herein. Methods for 
preparing and isolating polyclonal and monoclonal 
antibodies are well known in the art. See, for example, 
Current Protocols in Immunology , Cooligan, et al. (eds.), 

30 National Institutes of Health, John Wiley and Sons, Inc., 
1995; Sambrook et al., Molecular Cloning : A Laboratory 
Manual , Second Edition , Cold Spring Harbor, NY, 1989; and 
Hurrell, J. G. R., Ed., Monoclonal Hybridoma Antibodies: 
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Techniques and Applications , CRC Press, Inc., Boca Raton, 
FL, 1982. 

As would be evident to one of ordinary skill in 
the art, polyclonal antibodies can be generated from 
5 inoculating a variety of warm-blooded animals such as 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
and rats with a zCTGF4 polypeptide or a fragment thereof. 

The immunogenicity of a zCTGF4 polypeptide may 
be increased through the use of an adjuvant, such as alum 

10 (aluminum hydroxide) or Freund's complete or incomplete 
adjuvant. Polypeptides useful for immunization -also 
include fusion polypeptides, such as fusions of zCTGF4 or 
a portion thereof with an immunoglobulin polypeptide or 
with maltose binding protein. The polypeptide immunogen 

15 may be a full-length molecule or a portion thereof. If 
the polypeptide portion is "hapten-like", such portion may 
be advantageously joined or linked to a macromolecular 
carrier (such as keyhole limpet hemocyanin (KLH) , bovine 
serum albumin (BSA) or tetanus toxoid) for immunization. 

20 As used herein, the term "antibodies" includes 

polyclonal antibodies, affinity-purified polyclonal 
antibodies, monoclonal antibodies, and antigen-binding 
fragments, such as F(ab r )2 and Fab proteolytic fragments. 

Genetically engineered intact antibodies or fragments, 
25 such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen- 
binding peptides and polypeptides, are also included. 
Non-human antibodies may be humanized by grafting non- 
human CDRs onto human framework and constant regions, or 
30 by incorporating the entire non-human variable domains 
(optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a 
"veneered" antibody). In some instances, humanized 

antibodies may retain non-human residues within the human 
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variable region framework domains to enhance proper 
binding characteristics. Through humanizing antibodies , 
biological half-life may be increased, and the potential 
for adverse immune reactions upon administration to humans 
5 is reduced. Moreover, human antibodies can be produced in 
transgenic, non-human animals that have been engineered to 
contain human immunoglobulin genes as disclosed in WIPO 
Publication WO 98/24893. It is preferred that the 
endogenous immunoglobulin genes in these animals be 

10 inactivated or eliminated, such as by homologous 
recombination. 

Antibodies are considered to be specifically 
binding if: 1) they exhibit a threshold level of binding 
activity, and 2) they do not significantly cross-react 

15 with related polypeptide molecules. A threshold level of 
binding is determined if anti- zCTGF4 antibodies herein 
bind to a zCTGF4 polypeptide, peptide or epitope with an 
affinity at least 10-fold greater than the binding 
affinity to control (non- zCTGF4) polypeptide. It is 

20 preferred that the antibodies exhibit a binding affinity 

6—1 7—1 
(K a ) of 10 M or greater, preferably 10 M or greater, 

8 — 1 

more preferably 10 M or greater, and most preferably 

10 M or greater. The binding affinity of an antibody 
can be readily determined by one of ordinary skill in the 

25 art, for example, by Scatchard analysis (Scatchard, G., 
Ann. NY Acad. Sci . 51 : 660-672, 1949) . 

Whether anti-zCTGF4 antibodies do not 
significantly cross-react with related polypeptide 
molecules is shown, for example, by the antibody detecting 

30 zCTGF4 polypeptide but not known related polypeptides 
using a standard Western blot analysis (Ausubel et al., 
ibid.). Examples of known related polypeptides are those 
disclosed in the prior art, such as known orthologs, and 
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paralogs, and similar known members of a protein family, 
such as other known human CTGFs (e.g., CTGF and CTGF-2) . 
Screening can also be done using non-human zCTGF4, and 
2CTGF4 mutant polypeptides. Moreover, antibodies can be 
5 "screened against 7 ' known related polypeptides , to isolate 
a population that specifically binds to the zCTGF4 
polypeptides. For example, antibodies raised to zCTGF4 
are adsorbed to related polypeptides adhered to insoluble 
matrix; antibodies specific to zCTGF4 will flow through 

10 the matrix under the proper buffer conditions. Screening 
allows isolation of polyclonal and monoclonal antibodies 
non-crossreactive to known closely related polypeptides 
( Antibodies: A Laboratory Manual , Harlow and Lane (eds.), 
Cold Spring Harbor Laboratory Press, 1988; Current 

15 Protocols in Immunology , Cooligan, et al. (eds.), National 
Institutes of Health, John Wiley and Sons, Inc., 1995). 
Screening and isolation of specific antibodies is well 
known in the art. See, Fundamental Immunology , Paul 
(eds.), Raven Press, 1993; Getzoff et al., Adv. In 

20 Immunol. 43>: 1-98, 1988; Monoclonal Antibodies : 
Principles and Practice , Goding, J.W. (eds.), Academic 
Press Ltd. , 1996; Benjamin et al . , Ann . Rev . Immunol . 2: 
67-101, 1984. Specifically binding anti-zCTGF4 antibodies 
can be detected by a number of methods in the art, and 

25 disclosed below. 

A variety of assays known to those skilled in 
the art can be utilized to detect antibodies which bind to 
zCTGF4 proteins or polypeptides. Exemplary assays are 
described in detail in Antibodies: A Laboratory Manual , 

30 Harlow and Lane (Eds.), Cold Spring Harbor Laboratory 
Press, 1988. Representative examples of such assays 
include : concurrent Immunoelectrophoresis , 

radioimmunoassay, radioimmuno-precipitation, enzyme-linked 
immunosorbent assay (ELISA) , dot blot or Western blot 
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assay, inhibition or competition assay, and sandwich 
assay. In addition, antibodies can be screened for 
binding to wild-type versus mutant zCTGF4 polypeptide. 

Alternative techniques for generating or 
5 selecting antibodies useful herein include in vitro 
exposure of lymphocytes to zCTGF4 protein or peptide, and 
selection of antibody display libraries in phage or 
similar vectors (for instance, through use of immobilized 
or labeled ZCTGF4 protein or peptide) . Genes encoding 

10 polypeptides having potential ZCTGF4 polypeptide binding 
domains can be obtained by screening random peptide 
libraries displayed on phage (phage display) or on 
bacteria, such as E. coli. Nucleotide sequences encoding 
the polypeptides can be obtained in a number of ways, such 

15 as through random mutagenesis and random polynucleotide 
synthesis. These random peptide display libraries can be 
used to screen for peptides which interact with a known 
target which can be a protein or polypeptide, such as a 
ligand or receptor, a biological or synthetic 

20 macromolecule, or organic or inorganic substances. 
Techniques for creating and screening such random peptide 
display libraries are known in the art (Ladner et al . , US 
Patent NO. 5,223,409; Ladner et al., US Patent NO, 
4,946,778; Ladner et al . , US Patent NO. 5,403,484 and 

25 Ladner et al., US Patent NO. 5,571,698) and random peptide 
display libraries and kits for screening such libraries 
are available commercially, for instance from Clontech 
(Palo Alto, CA) , Invitrogen Inc. (San Diego, CA) , New 
England Biolabs, Inc. (Beverly, MA) and Pharmacia LKB 

30 Biotechnology Inc. (Piscataway, NJ) . Random peptide 
display libraries can be screened using the ZCTGF4 
sequences disclosed herein to identify proteins which bind 
to zCTGF4 . These "binding polypeptides' 7 which interact 
with zCTGF4 polypeptides can be used for tagging cells; 
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for isolating homolog polypeptides by affinity 
purification; they can be directly or indirectly 
conjugated to drugs, toxins, radionuclides and the like. 
These binding polypeptides can also be used in analytical 
5 methods such as for screening expression libraries and 
neutralizing activity, e.g., for blocking interaction 
between ligand and receptor, or viral binding to a 
receptor. The binding polypeptides can also be used for 
diagnostic assays for determining circulating levels of 

10 zCTGF4 polypeptides; for detecting or quantitating soluble 
zCTGF4 polypeptides as marker of underlying pathology or 
disease. These binding polypeptides can also act as 
ZCTGF4 "antagonists" to block zCTGF4 binding and signal 
transduction in vitro and in vivo. These anti-zCTGF4 

15 binding polypeptides would be useful for inhibiting zCTGF4 
activity or protein-binding. 

Antibodies to zCTGF4 may be used for tagging 
cells that express zCTGF4; for isolating zCTGF4 by 
affinity purification; for diagnostic assays for 

20 determining circulating levels of zCTGF4 polypeptides; for 
detecting or quantitating soluble zCTGF4 as marker of 
underlying pathology or disease; in analytical methods 
employing FACS; for screening expression libraries; for 
generating anti-idiotypic antibodies; and as neutralizing 

25 antibodies or as antagonists to block zCTGF4 in vitro and 
in vivo. In particular, antibodies will be useful for 
diagnostics, due to the association of proteins of the 
present invention with extracellular matrix and vessels, 
and labeled proteins will be useful in the diagnosis of 

30 diseases such as bone marrow fibrosis, prevention of scar 
tissue formation, cutaneous lupus erythematosis , 
scleroderma, dermatositis , and end-stage kidney failure. 

Antibodies or polypeptides herein may also be 
directly or indirectly conjugated to drugs, toxins, 
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radionuclides and the like, and these conjugates used for 
in vivo diagnostic or therapeutic applications. Moreover, 
antibodies to zCTGF4 or fragments thereof may be used in 
vitro to detect denatured ZCTGF4 or fragments thereof in 
5 assays, for example, Western Blots or other assays known 
in the art. 

Antibodies or polypeptides herein can also be 
directly or indirectly conjugated to- drugs, toxins, 
radionuclides and the like, and these conjugates used for 

10 in vivo diagnostic or therapeutic applications. _ For 
instance, polypeptides or antibodies of the present 
invention can be used to identify or treat tissues or 
organs that express a corresponding anti- complementary 
molecule (receptor or antigen, respectively, for 

15 instance) . More specifically, zCTGF4 polypeptides or 
anti-zCTGF4 antibodies, or bioactive fragments or portions 
thereof, can be coupled to detectable or cytotoxic 
molecules and delivered to a mammal having cells, tissues 
or organs that express the anti -complementary molecule. 

2 0 For example, for use of the antibodies and polypeptides, 

labeled for detection by imaging technologies, will be 
useful for diagnosing diseases associated with 
extracellular matrix and vessels, such as, bone marrow 
fibrosis, aberrant hematopoiesis , prevention of scar 
25 tissue formation, cutaneous lupus erythematosis , 
scleroderma, dermatositis , and end- stage kidney failure. 

Suitable detectable molecules may be directly or 
indirectly attached to the polypeptide or antibody, and 
include radionuclides, enzymes, substrates, cof actors, 

3 0 inhibitors, fluorescent markers, chemiluminescent markers, 

magnetic particles and the like. Suitable cytotoxic 
molecules may be directly or indirectly attached to t.he 
polypeptide or antibody, and include bacterial or plant 
toxins (for instance, diphtheria toxin, Pseudomonas 
35 exotoxin, ricin, abrin and the like) , as well as 
therapeutic radionuclides, such as iodine-131, rhenium-188 



or yttrium- 90 (either directly attached to the polypeptide 
or antibody, or indirectly attached through means of a 
chelating moiety, for instance) . Polypeptides or 

antibodies may also be conjugated to cytotoxic drugs, such 
as adriamycin. For indirect attachment of a detectable or 
cytotoxic molecule, the detectable or cytotoxic molecule 
can be conjugated with a member of a complementary/ 
anticomplementary pair, where the other member is bound to 
the polypeptide or antibody portion. For these purposes, 
biotin/streptavidin is an exemplary complementary/ 
anticomplementary pair. 

In another embodiment, polypeptide- toxin fusion 
proteins or antibody- toxin fusion proteins can be used for 
targeted cell or tissue inhibition or ablation (for 
instance, to treat diseases caused by inappropriate growth 
of cells or tissues) . Such molecule fusion proteins thus 
represent a generic targeting vehicle for cell/tissue- 
specific delivery of generic anti- complementary- 
detectable/ cytotoxic molecule conjugates. 

In another embodiment, zCTGF4 -cytokine fusion 
proteins or antibody- cytokine fusion proteins can be used 
for enhancing in vivo killing of target tissues (for 
example , blood and bone marrow cancers) , if the zCTGF4 
polypeptide or anti-zCTGF4 antibody targets the 
hyperprolif erative blood or bone marrow cell (See, 
generally, Hornick et al., Blood 81:4437-47, 1997). They 
described fusion proteins enable targeting of a cytokine 
to a desired site of action, thereby providing an elevated 
local concentration of cytokine. Suitable ZCTGF4 

polypeptides or anti-zCTGF4 antibodies target an 
undesirable cell or tissue (i.e., a tumor or a leukemia), 
and the fused cytokine mediated improved target cell lysis 
by effector cells. Suitable cytokines for this purpose 
include interleukin 2 and granulocyte-macrophage colony- 
stimulating factor (GM-CSF) , for instance. 
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In another embodiment, if the zCTGF4 polypeptide 
or anti-zCTGF4 antibody targets vascular cells or tissues, 
such polypeptide or antibody may be conjugated with a 
radionuclide, and particularly with a beta-emitting 
radionuclide, to reduce restenosis. Such therapeutic 
approach poses less danger to clinicians who administer 
the radioactive therapy. For instance, iridium- 192 
impregnated ribbons placed into stented vessels of 
patients until s 'the required radiation dose was delivered 
showed decreased tissue growth in the vessel and greater 
luminal diameter than the control group, which received 
placebo ribbons. Further, revascularisation and stent 
thrombosis were significantly lower in the treatment 
group. Similar results are predicted with targeting of a 
bioactive conjugate containing a radionuclide, as 
described herein. 

The bioactive polypeptide or antibody conjugates 
described herein can be delivered intravenously, 
intraarterially or intraductally , or may be introduced 
locally at the intended site of action. 

Molecules of the present invention can be used 
to identify and isolate receptors involved in growth and 
differentiation of zCTGF4 responsive cells. For example, 
proteins and peptides of the present invention can be 
immobilized on a column and membrane preparations run over 
the column ( Immobilized Affinity Liaand. Techniques , 
Hermanson et al., eds., Academic Press, San Diego, CA, 
1992, pp. 195-202) . Proteins and peptides can also be 
radiolabeled ( Methods in Enzvmol . , vol. 182, "Guide to 
Protein Purification", M. Deutscher, ed. , Acad. Press, San 
Diego, 1990, 721-737) or photoaf f inity labeled (Brunner et 
al., Ann. Rev. Biochem, 62.: 483 -514, 1993 and Fedan et al . , 
Biochem . Pharmacol . 33.: 1167-1180 , 1984) and specific cell- 
surface proteins can be identified. 

The molecules of the present invention will be 
useful for regulating the growth and/or differentiation of 



zCTGF4 responsive cells. The polypeptides, nucleic acid 
and/or antibodies of the present invention can be used in 
treatment of disorders associated with unregulated growth 
in ZCTGF4- responsive tissues. In particular, the 

molecules of the present may used to produce antagonists 
to treat or prevent development of pathological conditions 
in tissues as testis, trachea, bone marrow, and kidney. 
Certain diseases such as bone marrow fibrosis, prevention 
of scar tissue formation, cutaneous lupus erythematosis , 
scleroderma, dermatositis , and end-stage kidney failure, 
may be amenable to such diagnosis, treatment* or 
prevention. 

Polynucleotides encoding zCTGF4 polypeptides are 
useful within gene therapy applications where it is 
desired to increase or inhibit zCTGF4 activity. If a 
mammal has a mutated or absent zCTGF4 gene, the zCTGF4 
gene can be introduced into the cells of the mammal. In 
one embodiment, a gene encoding a zCTGF4 polypeptide is 
introduced in vivo in a viral vector. Such vectors 

include an attenuated or defective DNA virus, such as, but. 
not limited to, herpes simplex virus (HSV) , 
papillomavirus, Epstein Barr virus (EBV) , adenovirus, 
adeno-associated virus (AAV), and the like. Defective 
viruses, which entirely or almost entirely lack viral 
genes, are preferred. A defective virus is not infective 
after introduction into a cell. Use of defective viral 
vectors allows for administration to cells in a specific, 
localized area, without concern that the vector can infect 
other cells. Examples of particular vectors include, but. 
are not limited to, a defective herpes simplex virus 1 
(HSV1) vector (Kaplitt et al . , Molec . Cell. Neurosci. 
2:320-30, 1991); an attenuated adenovirus vector, such as 
the vector described by Stratf ord-Perricaudet et al . , *L_ 
Clin. Invest. 90:626-30, 1992; and a defective adeno- 
associated virus vector (Samulski et al . , J. Virol. 
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61:3096-101, 1987; Samulski et al . , J. Virol. . 63:3822-28, 
1989) . 

In another embodiment, the zCTGF4 gene can be 
introduced in a retroviral vector, e.g., as described in 
Anderson et al., U.S. Patent No. 5,399,346; Mann et al . 
Cell 33:153, 1983; Temin et al . , U.S. Patent No. 
4 650,764; Temin et al . , U.S. Patent No. 4,980,289; 
Markowitz et al . , J. Virol- 62:1120, 1988; Temin et al., 
U.S. Patent No. 5,124,263; International Patent 
Publication No. WO 95/07358, published March 16, 1995 by 
Dougherty et al . ; and Kuo et al . , Blood 82 = 845-852, 1-993. 
Alternatively, the vector can be introduced by lipofectron 
in vivo using liposomes. Synthetic cationic lipids can be 
used to prepare liposomes for in vivo transfection of a 
gene encoding a marker (Feigner et al., Proc Natl. Acad. 
Scjb USA 84:7413-17, 1987; Mackey et al . , Proc. Natl. 
Acad ScjL USA 85:8027-31, 1988). The use of lipofection 
to introduce exogenous genes into specific organs in vivo 
has certain practical advantages. Molecular targeting of 
liposomes to specific cells represents one area of 
benefit. More particularly, directing transfection to 
particular cells represents one area of benefit. For 
instance, directing transfection to particular cell types 
would be particularly advantageous in a tissue wxth 
cellular heterogeneity, such as the pancreas, liver, 
kidney, and brain. Lipids may be chemically coupled to 
other molecules for the purpose of targeting. Targeted 
peptides (e.g., hormones or neurotransmitters), proteins 
such as antibodies, or non-peptide molecules can be 
coupled to liposomes chemically. 

It is possible to remove the target cells from 
the body; to introduce the vector as a naked DNA plasmid; 
and then to re-implant the transformed cells into the 
body Naked DNA vectors for gene therapy can be 

introduced into the desired host cells by methods known in 
the art, e.g., transfection, electroporatxon, 
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microinjection, transduction, cell fusion, DEAE dextran, 
calcium phosphate precipitation, use of a gene gun or use 
of a DNA vector transporter. See, e.g., Wu et al . , J . 
Biol. Chem. 267 : 963-67, 1992; Wu et al . , J. Biol. Chem. 
5 263:14621-24, 1988. 

Antisense methodology can be used to inhibit 
zCTGF4 gene transcription, such as to inhibit cell 
proliferation in vivo. Polynucleotides that are 

complementary v - to a segment of a zCTGF4 -encoding 
10 polynucleotide (e.g., a polynucleotide as set forth in SEQ 
ID N0:1) are designed to bind to zCTGF4 -encoding mRNA and 
to inhibit translation of such mRNA. Such antisense 

Mi 

polynucleotides are used to inhibit expression of zCTGF4 
Q polypeptide -encoding genes in cell culture or in a 

f7 15 subject. 

m The present invention also provides reagents 

which will find use in diagnostic applications. For 
a example, the ZCTGF4 gene, a probe comprising zCTGF4 DNA or 

H* RNA or a subsequence thereof can be used to determine if 

Ik-; 

PI 20 the ZCTGF4 gene is present on chromosome 6 or if a 
If | mutation has occurred. Detectable chromosomal aberrations 

at the zCTGF4 gene locus include, but are not limited to, 
aneuploidy, gene copy number changes, insertions, 
deletions, restriction site changes and rearrangements. 
25 Such aberrations can be detected using polynucleotides of 
the present invention by employing molecular genetic 
techniques, such as restriction fragment length 
polymorphism (RFLP) analysis, short tandem repeat (STR) 
analysis employing PCR techniques, and other genetic 
3 0 linkage analysis techniques known in the art (Sambrook et 
al., ibid . ; Ausubel et . al . , ibid . ; Marian, Chest 

108 :255-65, 1995) . 

Mice engineered to express the zCTGF4 gene, 
referred to as "transgenic mice," and mice that exhibit a 
35 complete absence of ZCTGF4 gene function, referred to as 
"knockout mice," may also be generated (Snouwaert et al., 
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Science 257:1083, 1992; Lowell et al., Nature 366 :740-42, 
1993; Capecchi, M.R., Science 244 : 1288-1292, 1989; 
Palmiter, R.D. et al . Annu Rev Genet. 20 : 465-499, 1986) . 
For example, transgenic mice that over-express zCTGF4, 
either ubiquitously or under a tissue-specific or tissue- 
restricted promoter can be used to ask whether over- 
expression causes a phenotype . For example, over- 
expression of a wild-type zCTGF4 polypeptide, polypeptide 
fragment or a ^mutant thereof may alter normal cellular 
processes, resulting in a phenotype that identifies a 
tissue in which zCTGF4 expression is functionally relevant 
and may indicate a therapeutic target for the zCTGF4, its 
agonists or antagonists. Moreover, such over-expression 
may result in a phenotype that shows similarity with human 
diseases. Similarly, knockout zCTGF4 mice can be used to 
determine where zCTGF4 is absolutely required in vivo. 
The phenotype of knockout mice is predictive of the in 
vivo effects of that a ZCTGF4 antagonist, such as those 
described herein, may have. The human zCTGF4 cDNA can be 
used to isolate murine zCTGF4 mRNA, cDNA and genomic DNA, 
which are subsequently used to generate knockout mice. 
These mice may be employed to study the zCTGF4 gene and 
the protein encoded thereby in an in vivo system, and can 
be used as in vivo models for corresponding human 
diseases. Moreover, transgenic mice expression of zCTGF4 
antisense polynucleotides or ribozymes directed zCTGF4, 
described herein, can be used analogously to transgenic 
mice described above. 

For pharmaceutical use, the proteins of the 
present invention are formulated for parenteral, 
particularly intravenous or subcutaneous, delivery- 
according to conventional methods. Intravenous 
administration will be by bolus injection or infusion over 
a typical period of one to several hours. In general, 



75 



pharmaceutical formulations will include a zCTGF4 protein 
in combination with a pharmaceutically acceptable vehicle, 
such as saline, buffered saline, 5% dextrose in water or 
the like. Formulations may further include one or more 
5 excipients, preservatives, solubilizers, buffering agents, 
albumin to prevent protein loss on vial surfaces, etc. 
Methods of formulation are well known in the art and are 
disclosed, for example, in Remington: The Science and 
Practice of Pharmacy , Gennaro, ed., Mack Publishing Co., 
10 Easton, PA, 19th ed., 1995. Therapeutic doses will 

generally be in the range of 0.1 to 100 (xg/kg of patient 

weight per day, preferably 0.5-20 |xg/kg per day, with the 
r;? exact dose determined by the clinician according to 

accepted standards, taking into account the nature and 
y, 15 severity of the condition to be treated, patient traits, 
tt* etc. Determination of dose is within the level of 

Jj ordinary skill in the art. The proteins may be 

administered for acute treatment, over one week or less, 
[7 often over a period of one to three days or may be used in 

l!i 20 chronic treatment, over several months or years. The 
i| invention is further illustrated by the following non- 

-j, limiting examples. 

The invention is further illustrated by the 

following non-limiting examples. 

25 

EXAMPLES 

Example 1 

3 0 Scanning of a translated DNA database using the 

CTGF family motifs as a query resulted in identification 
of an EST sequence found to have some homology to the 
connective tissue growth factor family. 

Plasmid DNA was isolated from a clone 

3 5 corresponding to the EST, which had been designated 
zCTGF4, and was analyzed for polynucleotide sequence. 
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Upon alignment of the ZCTGF4 cDNA and CTGF family, it was 
revealed that the clone contained a truncated cDNA 
sequence with an intron at the 5 1 end of the sequence . 

Based on the tissue distribution from Northern 
5 blot analysis (see Example 2) , a cDNA library was 
constructed from human testis, and used to screen for a 
full length clone. The library was calculated to contain 
approximately 10 6 clones. The master plate containing 80 
pools (each pool representing 12 pools of 1.25 x 10 4 
10 clones) was screened using PCR to determine whether zCTFG4 
cDNA was present. Reactions were set up using: 1 (1*1 of 

each pool, 2 0 pmoles each of oligonucleotide primers 
ZC14,882 and ZC14,883 (SEQ ID NOS: 6 and 7, respectively), 
and 1 U of ExTaq® DNA polymerase (TaKaRa Shuzo Co., Ltd., 

15 Shiga, JP) > in a 25 [il volume. PCR was performed in a 96- 

well plate on a GenAmp PCR system 9700 (PE Applied 
Biosystems, Cheshire, UK) . The reaction was run as 
follows: 94°C for 1.5 minute, then for 30 cycles of 94°C, 
15 seconds; 55°C 2 0 seconds; 72°C 3 0 seconds; and ended 

20 with a 7 minute incubation at 72°C. Pools designated A3, 
A6, D3, D6 and F7 were positive for the presence of ZCTGF4 
DNA, and selected for further analyses. 

Positive pools were further analyzed with PCR to 
identify clones with longer 5 1 end sequence. The reaction 

25 mixture contained: 1 \xl of each pool, 20 pmoles each of 

oligonucleotide primers ZC15,909 (corresponding to a 
vector sequence) and ZC14,885 (corresponding to a gene 
specific sequence) (SEQ ID NOS: 8 and 9, respectively), 
and 1 U of mixture ExTaq (TaKaRa) and Pfu® (Stratagene, 

30 La Jolla, CA) (2:1) . DNA polymerase, in a 25 |il total 
volume. PCR was performed on a GenAmp PCR system 2400 (PE 
Biosystems) as follows: 94°C for 1.5 minute, then for 25 
cycles of 94°C, 15 seconds; 55°C 20 seconds; 72°C 3 0 
seconds; 7 minute incubation at 72°C. A second nested 

3 5 anchor PCR was performed using 1 \il of 1/50 diluted first 
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round PCR products as template, 2 0 pmoles each of 
oligonucleotide primers ZC15,911 (corresponding to vector 
sequence) and ZC14,884 (corresponding to gene specific 
sequence) (SEQ ID NOS : 10 and 11, respectively), and 1 U 
5 of mixture ExTaq (TaKaRa) and Pfu® (Stratagene) (2:1) DNA 

polymerase in a 25 jxl total volume. The reaction was run 
as follows: 94°C for 1.5 minute, then for 25 cycles of 
94°C, 15 seconds; 50°C 20 seconds; 72°C 3 0 seconds; 7 
minute incubation at 72°C. PCR products from A3, AG , D3 

10 and D6 were gel purified with QIAquick Gel Extraction kit 
(Qiagen Inc. Chatsworth, CA) , subcloned into a pCR2 . 1 
vector of TA Cloning® kit (Invitrogen, Carlsbad, CA) , 
which provides for direct ligation into expression vectors 
(Mead et al . , Bio /Technology 9 (7) :657-663, 1991), and 

15 designated as CTGF4a3, CTGF4a6, CTGF4d3 , CTGF4d6 . 

Sequence analysis of CTGF4a3, CTGF4a6, CTGF4d3, 
and CTGF4d6 revealed that the sequence of CTGF4a3 encoded 
a start Met with a putative signal peptide, a long open 
reading frame (ORF) at which the first reading frame was 

20 interrupted by a stop codon, and a short ORF thereafter at 
the second reading frame. The short ORF was identical to 
the 5 ! end coding region of zCTGF4, while the long ORF had 
a 45% sequence similarity to human CTGF. The sequence of 
CTGF4a6 was almost " identical to CTGF4a3 except it was 

25 short 16 N- terminal amino acids in the secretory signal 
peptide. Both CTGF4d3 and CTGF4d6 had smaller inserts. 
All four clones encoded a long ORF interrupted by a stop 
codon at same place . 

A PCR approach, as described previously, was 

30 used to find a correctly coded sequence in other tissues. 
A human bone marrow marathon cDNA was used based on the 
results of northern blot analysis (see Example 2) . cDNA 
was made using Clontech Marathon™ cDNA kit (Clontech, 
Palo Alto, CA) , according to the manufacturer's 

35 specifications. 5 |xl of 1/100 diluted Marathon™ bone 
marrow cDNA, 2 0 pmoles each of oligonucleotide primers 
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ZC15,910 and ZC14,885 (SEQ ID NOS : 12 and 9, 
respectively) , and 1 U of ExTaq (TaKaRa) and Pfu® 

(Stratagene) (2:1) DNA polymerase were used in a 25 jil of 
reactions. The reactions were run as follows: 94°C for 
1.5 minute, then for 25 cycles of 94°C, 15 seconds; 54°C 20 
seconds; 72°C 3 0 seconds; and ended with a 7 minute 
incubation at 72°C. 1 \il of 1/50 diluted first PCR 
product was used as template for nested PCR. 2 0 pmoles 
each of oligonucleotide primers ZC16 / 192 and ZC14 / 884 (SEQ 
ID NOS: 13 and 11, respectively) , and 1 U of ExTaq 

(TaKaRa) and Pfu® (Stratagene) (2:1) DNA polymerase -were 

used in a 25 jil of reactions. The reactions were run as 

follows: 94°C for 1.5 minute, then for 25 cycles of 94°C / 
15 seconds; 50°C 20 seconds; 72°C 30 seconds; and ended 
with a 7 minute incubation at 72°C. The PCR product was 
gel purified, subcloned into pCR2 . 1 vector (as described 
previously) , and sequenced. The sequence analysis 

revealed that the 5 1 and 3 1 end of bone marrow clone 
overlapped with the testis clones, with a discrepancy in 
the region having the reading frame shift in the testis 
clone. The sequence of bone marrow clone had a correct 
ORF, and was included in the composite full length 
sequence of zCTGF4 . The initially identified EST, which 
originated from a penis library, as well as clones 
identified from two testis libraries, had introns or 
reading frame shifts in the same region. Further analysis 
revealed that this region was least conserved area when 
compared to other members in the family, suggesting the 
reading frame shift may be a regulatory mechanism. 

Example 2 

Northern analyses were performed using Human 
Multiple Tissue Blots I, II and III from Clontech (Palo 
Alto, CA) . A probe was generated from a gel purified PCR 
product made from ZC14,8 83 (SEQ ID NO: 7) and ZC14,882 



(SEQ ID NO: 6) as primers and zCTGF4 as template, thac had 
been radioactively labeled with RED I PR I ME™ DNA labeling 
kit (Amersham, Arlington Heights, IL) according to the 
manufacturer's suggestion. The probe was purified using a 
NUCTRAP push column (Stratagene) . EXPRESSHYB™ (Clontech) 
solution was used for prehybridization and as a 
hybridizing solution for the Northern blots. 
Hybridization took place overnight at 65°C, and the blots 
were then washed in 2X SSC and 0.05% SDS at RT, followed 
by a wash in 0 . IX SSC and 0.1% SDS at 50°C. One major 
transcript was observed at size of 1.4 kb. Signals were 
present in testis, bone marrow, trachea, kidney, liver, 
stomach, small intestine, ovary, placenta, prostate and 
spinal cord. The expression of zCTGF4 was also examined 
with Human RNA Master blot (Clontech) with a probe 
generated from a PCR product amplified with ZC16,192 (SEQ 
ID NO: 13) and ZC14,884 (SEQ ID NO: 11) as primers and 
zCTGF4 as template DNA. The conditions for probe labeling 
and hybridization were the same as described above. 
zCTGF4 was positive in two other tissues, mammary gland 
and fetal kidney, in addition to those described above. 



Example 3 

zCTGF4 was mapped to chromosome 6 using the 
commercially available "GeneBridge 4 Radiation Hybrid 
Panel" (Research Genetics, 'Inc., Huntsville, AL) . The 
GeneBridge 4 Radiation Hybrid Panel contained DNAs from 
each of 93 radiation hybrid clones, plus two control DNAs 
(the HFL donor and the A23 recipient) . A publicly 
available WWW server (http://www-genome.wi.mit.edu/cgi- 
bin/contig/rhmapper .pi) allowed mapping relative to the 
Whitehead Institute/MIT Center for Genome Research's 
radiation hybrid map of the human genome (the "WICGR" 
radiation hybrid map) which was constructed with the 
GeneBridge 4 Radiation Hybrid Panel. 
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For the mapping of ZCTGF4 with the "GeneBridge 4 
RH Panel", 20 /xl reactions were set up in a 96-well 
microtiter plate (Stratagene, La Jolla, CA) , and used in a 
"RoboCycler Gradient 96" thermal cycler (Stratagene) . 
5 Each of the 95 PCR reactions consisted of 2 /xl 10X KlenTaq 
PCR reaction buffer (CLONTECH Laboratories, Inc., Palo 
Alto, CA) , 1.6 /xl dNTPs mix (2.5 mM each, PERKI N - ELMER , 
Foster City, CA) , 1 /xl sense primer, ZC15,089 (SEQ ID NO: 
' 14), 1 /xl« antisense primer, ZC15,092 (SEQ ID NO: 15), 2 /xl 
10 "RediLoad" (Research Genetics, Inc., Huntsville, AL) , 0.4 
/xl SOX Advantage KlenTaq Polymerase Mix (Clontech 
Laboratories, Inc.), 25 ng of DNA from an individual 
Li t hybrid clone or control and ddH 2 0 for a total volume of 20 

5f ul. The reactions were overlaid with an equal amount of 

u, 15 mineral oil and sealed. The PCR cycler conditions were as 
N ; follows: an initial 1 cycle 5 minute denaturation at 95°C 

jjl , 3 5 cycles of a 1 minute denaturation at 95°C, 1 minute 

* annealing at 56°C and 1.5 minute extension at 72°C, 

followed by a final 1 cycle extension of 7 minutes at 
N* 2 0 72°C. The reactions were separated by electrophoresis on a 
Tfi 2% agarose gel (GIBCO-BRL Life Technologies, Gaithersburg, 

O MD) . 

Hi The results showed that zCTGF4 maps 6.29 cR_3000 

from the framework marker WI- 4792 on the WICGR chromosome 

25 6 radiation hybrid map. Proximal and distal framework 
markers were WI-4792 and CHLC . GATA3 1.100, respectively. 
The use of surrounding markers positions ZCTGF4 in the 
6q22.1 region on the integrated LDB chromosome 6 map (The 
Genetic Location Database, University of Southhampton, WWW 

3 0 server: http: //cedar. genetics . soton.ac.uk/public_html/) . 

Example 4 

~~ Several expression constructs were made for 
eukaryotic expression of the zCTGF4 cDNA. A mammalian 
3 5 expression vector was constructed with the dihyrofolate 
reductase gene under control of the SV4 0 early promoter, 
SV4 0 polyadenylation site, a cloning site to insert the 
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gene of interest under control of the MT-1 promoter and 
the hGH polyadenylation site. The expression vector was 
designated pZP9 and is deposited at the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, MD. 
5 To facilitate purification, pZP9 vector was modified by 
addition of the tPA leader sequence (U.S. Patent 
5,641,655, incorporated herein by reference) and a GluGlu 
tag (SEQ ID NO: 16) between the MT-1 promoter and hGH 
terminator. The tPA leader replaces the native secretory 
10 signal sequence for DNAs encoding for polypeptides of 
interest that are inserted into this vector, • and 
expression results in an N- terminally tagged protein. ■ The 
N- terminally tagged vector was designated pZP9NEE. 
Another vector was similarly constructed with a C- terminal 
15 GluGlu tag (SEQ ID NO: 16) inserted just 5' to the hGH 
terminator and utilizes the native (or other fused) 
secretory signal sequence for secretion of the encoded 
polypeptide of interest, and expression resulted in a C- 
terminally tagged protein. The C- terminal GluGlu tagged 
20 vector was designated pZP9CEE. 

A 5' DNA fragment containing the zCTGF4 coding 
region (nucleotide 86 to nucleotide 613 of SEQ ID NO: 1) 
was generated by PCR using oligonucleotide primers 
ZC16,422 (SEQ ID NO: 17) and ZC16,424 (SEQ ID NO: 18) and 
25 CTGF4A3 as the template, and a 3' DNA fragment including 
nucleotide 590 to nucleotide 1078 of SEQ ID NO : 1 and 
oligonucleotides ZC16,421 (SEQ ID NO: 19) and ZC16,425 
(SEQ ID NO: 20) as primers and zCTGF4 as template. The 
PCR reactions were run as follows: 1 cycle at 94°C for 1.5 
minutes; 3 cycles of 94°C for 15 seconds; 50°C for 30; 72°C 
for 3 0 seconds; 12 cycles of 94°C for 15 seconds; 50°C for 
20 seconds; 72°C for 3 0 seconds and 1 cycle at 72°C for 2 
minutes. The PCR products were gel purified and mixed 
together as the template for the following PCR reaction: 1 
cycle at 94°C for 1.5 minutes; 3 cycles of 94°C for 15 
seconds; 54°C for 20; 68°C for 45 seconds; 20 cycles of 



30 



35 
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94°C for 15 seconds; 6 8°C for 45 seconds; and 1 cycle at 
72°C for 2 minutes. This PCR product, which contained the 
entire polynucleotide sequence encoding for the mature 
zCTGF4 polypeptide, was gel purified and restriction 
5 digested with BamH I and Xho I for use with the N- 
terminally tagged vector, pZP9NEE r described above. The 
zCTGF4 DNA sequence .was ligated into the vector pZP9NEE 
and E. coli transf ormants selected, and designated 
pZP9NEE/zCTGF4 ... 

10 Plasmid DNA was isolated and the region of the 

plasmid containing polynucleotides encoding for the" tPA 
leader followed by the DNA encoding the GluGlu tag (SEQ ID 
NO: 16) and mature CTGF4 polypeptide were excised using a 
5 ! EcoR I and a 3 1 Xbal site. The insert was sequence 

15 analyzed for verification. 

A similar cloning method was used to generate a 
C- terminally tagged expression construct. This construct, 
designated pZP9CEE/zCTGF4 comprises the pZP9CEE vector 
with DNAs encoding for the native zCTGF4 secretory signal 

2 0 sequence, the mature zCTGF4 polypeptide (shown in SEQ ID 

NO: 1) and a C- terminal GluGlu tag (SEQ ID NO: 16) . 

The fragment from pZP9NEE/zCTGF4 containing DNA 
encoding for the the tPA secretory signal peptide, the 
GluGlu tag and the mature ZCVTGF4 were ligated into a 
25 baculovirus vector designated pZBV4L. pZBV4L is a 

baculovirus expression vector derived from . the FASTBAC 
vector of the Bac-to-Bac™ system (GIBCO-BRL, 

Gaithersburg, MD) and described in Luckow et al . , J, 
Virol . 67:4566-4579, 1993. The pFASTBAC vector was 

3 0 modified by removing the polyhedrin promoter and 

substituting the baculovirus basic protein promoter (Hill- 
Perkins et al . , J. Gen. Virol. 71:971-976, 1990; Bonning 
et al., J. Gen. Virol. 75:1551-1556, 1994; and Chazenbalk 
et al., J. Biol. Chem. 270 :1543-154 9, 1995). 
35 One microliter of zCTGF4NEE/pZBV4L expression 

construct is used to transform 2 0 jal DHlOBac (GIBCO-BRL, 
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Gaithersburg, MD) in 980 jil SOC (2% Bacto Tryptone, 0.5% 
Bacto Yeast Extract, 10 ml 1M NaCl, 1.5 mM KC1, 10 mM 
MgCl 2 , 10 mM MgS0 4 and 2 0 mM glucose) , according to the 
manufacturer's specifications. The cells are incubated 4 8 
hours at 3 7°C and 2 colonies in which virus had 
incorporated the plasmid (referred to as a "bacmid") were 
identified (white in color) , and are isolated. Bacmid DNA 
is isolated and used to transfect Spodoptera frugiperda 
(Sf 9) cells using transfect ion liposomes from Cellf ectin 
(GIBCO-BRL) . The cells are cultured at 27°C in shake 
flasks using 50-100 ml of Sf900II serum free medium 
(GIBCO-BRL) . After 3-4 days medium conditioned by the 
virus is harvested and used to infect Sf9 cells in mid- log 
growth at approximately 1E6 cells/ml. 

The cells are scaled up by adding cell cultures 
to volumes of 15 liter when cells have achieved a density 
between 1-2E6 cell/ml and then infected with an MOI of 1- 
3. After 2 days at 27°C, the medium containing the 
protein and virus is harvested. 

Example 5 

A. Affinity Tagged zCTGF4 Protein 

zCTGF4 expressed with an N-terminal or C- 

terminal GluGlu (EE) tag is purified as follows: A 

mixture of protease inhibitors is added to a 2000 ml 

sample of conditioned media from baculovirus- infected Sf9 

cells to final concentrations of 2.5 mM 

ethylenediaminetetraacetic acid (EDTA, Sigma Chemical Co. 

St. Louis, MO), 0.001 mM leupeptin (Boehringer -Mannheim, 

Indianapolis, IN), and 0.001 mM pepstatin (Boehringer- 

Mannheim) and 0.4 mM Pefabloc (Boehringer-Mannheim) . 

The sample is centrifuged at 10,000 rpm for 30 min at 4°C 

in a Beckman JLA-10.5 rotor (Beckman Instruments, Palo 

Alto, CA) in a Beckman Avanti J25I centrifuge (Beckman 

Instruments) to remove cell debris. To the supernatant 

fraction is added a 50.0 ml sample of anti-EE Sepharose, 
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prepared as described below, and the mixture is gently 
agitated on a Wheaton (Millville, NJ) roller culture 
apparatus for 18.0 h at 4°C. 

The mixture is poured into a 5.0 x 20.0 cm 
Econo-Column (Bio-Rad, Laboratories, Hercules, CA) and the 
gel is washed with 3 0 column volumes of phosphate buffered 
saline (PBS) . The unretained flow- through fraction is 
discarded. When the absorbance of the effluent at 280 nM 
is less than 0.05, flow through the column is reduced to 
zero and the anti-EE Sepharose gel is washed with 2.0 
column volumes of PBS containing 0.2 mg/ml of EE peptide 
(AnaSpec, San Jose, CA) . The peptide used has ■ the 
sequence GluTyrMetGlu (SEQ ID NO: 16). After 1.0 h at 
4°C, flow is resumed and the eluted protein is collected. 
This fraction is the peptide elution. The anti-EE 
Sepharose gel is washed with 2 . 0 column volumes of 0 . 1M 
glycine, pH 2.5, and the glycine wash is collected 
separately. The pH of the glycine-eluted fraction is 
adjusted to 7.0 by the addition of a small volume of 10X 
PBS and stored at 4°C. 

The peptide elution is concentrated to 5.0 ml 
using a 15,000 molecular weight cutoff membrane 
concentrator (Millipore, Bedford, MA) , according to the 
manufacturer's instructions. The concentrated peptide 
elution is separated from free peptide by chromatography 
on a 1.5 x 50 cm Sephadex G-50 (Pharmacia, Piscataway, 
NJ) column equilibrated in PBS at a flow rate of 1.0 
ml/min using a BioCad Sprint HPLC (PerSeptive BioSystems, 
Framingham, MA) . Two-ml fractions are collected and the 
absorbance at 280 nM is monitored. The first peak of 

material absorbing at 280 nM and eluting near the void 
volume of the column is collected. This fraction is pure 
N-terminally tagged zCTGF4 or C-terminally tagged zCTGF4 . 
The pure material is concentrated as described above, 
analyzed by SDS-PAGE and Western blotting with antiEE 
antibodies, aliquoted, and stored at -80°C. 
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Preparation of anti-EE Sepharose is done as 
follows: A 100 ml bed volume of protein G-Sepharose 
(Pharmacia, Piscataway, NJ) is washed 3 times with 10 0 ml 
of PBS containing 0.02% sodium azide using a 500 ml 
Nalgene 0.45 micron filter unit. The gel is washed with 
6.0 volumes of 200 mM triethanolamine, pH 8.2 (TEA, Sigma, 
St. Louis, MO), and an equal volume of EE antibody- 
solution containing 900 mg of antibody is added. After an 
overnight incubation at 4 ° C , unbound ant ibody is removed 
by washing the resin with 5 volumes of 20 0 mM TEA as 
described above. The resin is resuspended in 2 volumes of 
TEA, transferred to a suitable container, 1 and 
dimethylpimilimidate-2HCl (Pierce, Rockford, IL) , 
dissolved in TEA, is added to a final concentration of 36 
mg/ml of gel . The gel is rocked at room temperature for 
45 min and the liquid is removed using the filter unit as 
described above. Nonspecific sites on the gel are then 
blocked by incubating for 10 min. at room temperature with 
5 volumes of 2 0 mM ethanolamine in 200 mM TEA. The gel is 
then washed with 5 volumes of PBS containing 0.02% sodium 
azide and stored in this solution at 4°C. 

B. Untagged zCTGF4 Protein 

Protease inhibitors are added to the conditioned 

media of baculovirous- infected Sf9 cells and the media 
will be centrifuged as described above for the EE-tagged 
proteins. The supernatant fraction is applied to a 50.0 
ml column of POROS HE1 (PerSeptive BioSystems, Framingham, 
MS) pre-equilibrated in 20 mM Tris-HCl, 50 mM NaCl, pH 7.4 
at a flow rate of 2.0 ml/min with in-line dilution (three- 
fold final dilution) with water as diluent using a BioBad 
Sprint HPLC (PerSeptive BioSystems, Framingham, MS) . 
Heparin-bound proteins are eluted with a 0.1-1.0 M 
gradient of NaCl . Protein-containing fractions are 

identified by absorbance at 280 nM and by SDS-PAGE. 
Fractions containing zCTGF4 are identified by a band on 
SDS-PAGE gels of the appropriate molecular weight for 
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glycosylated zCTGF4 of approximately 4 0 kDa The heparin- 
bound zCTGF4 pool is concentrated, applied to a Sephadex- 
G50 or a -G100 column, and eluted as described above. 
Purified zCTGF4 is characterized by SDS-PAGE, amino acid 
5 analysis, and N- terminal sequencing. 

Example 6 

A. Adenoviral Expression of zCTGF4 

The N . protein coding region of zCTGF4 was 
10 amplified by PCR using primers that added Fsel and AscI 

restriction sties at the 5' and 3' termini respectively. 

PCR primers ZC17948 (SEQ. ID. NO: 29) and ZC17949 (SEQ ID 
!^ ( NO: 30) were used with a template containing the full- 

er length zCTGF4 cDNA in a PCR reaction as follows: one cycle 

2 15 at 95°C for 5 minutes; followed by 15 cycles at 95°C for 1 
M* min., 58°C for 1 min., and 72°C for 1.5 min.; followed by 

rj:| 72°C for 7 min.; followed by a 4°C soak. The PCR reaction 

£i product was loaded onto a 1.2 % (low melt) SeaPlaque GTG 

:\ (FMC, Rockland, ME) gel in TAE buffer. The zCTGF4 PCR 

1 

y : 20 product was excised from the gel and purified using the 
p QIAquick™ PCR Purification Kit gel cleanup kit as per kit 

p instructions (Qiagen) . The PCR product was then digested 

H : with Fsel-AscI, phenol/chloroform extracted, EtOH 

precipitated, and rehydrated in 20ml TE (Tris/EDTA pH 8 ) . 
25 The 1065 bp zCTGF4 fragment was then ligated into the 
Fsel-AscI sites of the transgenic vector pTG12-8 (See, 
description herein) and transformed into DH10B competent 
cells by electroporation . Clones containing zCTGF4 were 
identified by plasmid DNA miniprep followed by digestion 
30 with Fsel-AscI. -A positive clone was confirmed by direct 
sequencing . 

B . Preparation of DNA construct for generation of 
Adenovirus 

35 The 1065 bp zCTGF4 cDNA was released from a 

TG12-8 vector using Fsel and AscI enzymes. The cDNA was 
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isolated on a 1% low melt SeaPlaque GTG™ (FMC, Rockland, 
ME) gel, and was then excised from the gel. The gel slice 
was melted at 70°C, extracted twice with an equal volume 
of Tris buffered phenol, and EtOH precipitated. The DNA 
5 was resuspended in lOjxl H 2 0. 

The zCTGF4 cDNA was cloned into the Fsei-AscI 
sites of a modified pAdTrack CMV (He et al., PNAS 95:2509- 
2514, 1998). This construct contains the GFP marker gene. 
The CMV promoter driving GFP expression was replace with 

10 the SV40 promoter and the SV40 polyadenylation signal was 
replaced with the human growth hormone polyadenylation 
signal. In addition, the native polylinker was replaced 
with Fsel, EcoRV, and AscI sites. This modified form of 
pAdTrach CMV was named pZyTrack. Ligation was performed 

15 using the Fast-Link™ DNA ligation and screening kit 
(Epicentre Technologies,- Madison, WI) . In order to 

linearize the plasmid, approximately 5 jig of the pZyTrack 
zctgf4 plasmid was digested with Pmel. Approximately 1 \ig 
of the linearized plasmid was cotransf ormed with 200ng of 

20 supercoiled pAdEasy (He et al., supra . ) into BJ5183 cells. 
The co-transformation was done using a Bio-Rad Gene Pulser 
' at 2.5kV, 200 ohms and 25mFa. The entire co- 

transformation was plated on 4 LB plates containing 25 
jig/ml kanamycin. The smallest colonies were picked and 

25 expanded in LB/kanamycln and recombinant adenovirus DNA 
identified by standard DNA miniprep procedures. Digestion 
of the recombinant adenovirus DNA with Fsei-AscI confirmed 
the presence of zCTGF4 . The recombinant adenovirus 
miniprep DNA was transformed into DH10B competent cells 

30 and DNA prepared using a Qiagen maxi prep kit as per kit 
instructions . 

C. Transfection of 293A Cells with Recombinant DNA 

Approximately 5 |Lig of recombinant adenoviral DNA 
35 was digested with Pad enzyme (New England Biolabs) for 3 
hours at 37°C in a reaction volume of 100 p,l containing 
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20-30U of Pad. The digested DNA was extracted twice with 
an equal volume of phenol/chloroform and precipitated with 
ethanol. The DNA pellet was resuspended in 10p,l distilled 
water. A T25 flask of QB1-293A cells {Quantum 
Biotechnologies, Inc. Montreal, Qc. Canada), inoculated 
the day before and grown to 60-70% confluence, were 
transfected with the -Pad digested DNA. The Pad-digested 
DNA was diluted up to a total volume of 50jutl with sterile 
HBS (150mM NaCl, 20mM HEPES) . In a separate tube, 20 |xl 
DOTAP (Boehringer Mannheim, lmg/ml) was diluted to a total 
volume of lOOjxl with HBS. The DNA was added to the DOTAP , 
mixed gently by pipeting up and down, and left at room 
temperature for 15 minutes. The media was removed from 
the 293A cells and washed with 5 ml serum-free MEM-alpha 
(Gibco BRL) containing 1 mM Sodium Pyruvate (GibcoBRL) , 
0.1 mM MEM non-essential amino acids (GibcoBRL) and 25 mM 
HEPES buffer (GibcoBRL) . 5 ml of serum-free MEM was added 
to the 2 93A cells and held at 37°C. The DNA/lipid mixture 
was added drop-wise to the T25 flask of 293A cells, mixed 
gently and incubated at 37°C for 4 hours. After 4 h the 
media containing the " DNA/lipid mixture was aspirated off 
and replaced with 5 ml complete MEM containing 5% fetal 
bovine serum. The transfected cells were monitored for 
Green Fluorescent Protein (GFP) expression and formation 
of foci, i.e., viral plaques ♦ 

Seven days after transfection of 293A cells with 
the recombinant adenoviral DNA, the cells expressed the 
GFP protein and started to form foci. These foci are 
viral "plaques" and the crude viral lysate was collected 
by using a cell scraper to collect all of the 293A cells. 
The lysate was transferred to a 50ml conical tube. To 
release most of the virus particles from the cells, three 
freeze/thaw cycles were done in a dry ice/ethanol bath and 
a 37° waterbath. 
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D. Amplification of Recombinant Adenovirus (rAdV) 

The crude lysate was amplified (Primary (1°) 

amplification) to obtain a working "stock" of zCTGF4 rAdV 
lysate. Ten 10cm plates of nearly confluent (80-90%) 293A 
5 cells were set up 20 hours previously, 200ml of crude rAdV 
lysate added to each 10cm plate and monitored for 48 to 72 
hours looking for CPE under the white light microscope and 
expression of GFP under the fluorescent microscope. When 
all of the 293A cells showed CPE (Cytopathic Effect) this 

10 1° stock lysate was collected and freeze/thaw cycles 
performed as described under Crude rAdV Lysate. 

Secondary (2°) Amplification of zCTGF4 rAdV was 
obtained as follows: Twenty 15 cm tissue culture dishes 
of 293A cells were prepared so that the cells were 80-90% 

15 confluent. All but 20 mis of 5% MEM media was removed and 
each dish was inoculated with 300-500 ml 1° amplified rAdv 
lysate. After 48 hours the 293A cells were lysed from 
virus production and this lysate was collected into 250 ml 
polypropylene centrifuge bottles and the rAdV purified. 

20 

E. AdV/cDNA Purification 

NP-40 detergent was added to a final 
concentration of 0.5% to the bottles of crude lysate in 
order to lyse all cells. Bottles were placed on a 

25 rotating platform for 10 min. agitating as fast as 
possible without the bottles falling over. The debris was 
pelleted by centrif ugation at 20,000 X G for 15 minutes. 
The supernatant was transferred to 250 ml polycarbonate 
centrifuge bottles and 0.5 volumes of 20% PEG8000/2.5M 

30 NaCl solution added. The bottles were shaken overnight on 
ice. The bottles were centrifuged at 20,000 X G for 15 
minutes and supernatant discarded into a bleach solution. 
The white precipitate in two vertical lines along the wall 
of the bottle on either side of the spin mark is the 

35 precipitated virus/PEG. Using a sterile cell scraper, the 
precipitate from 2 bottles was resuspended in 2.5 ml PBS. 
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The virus solution was placed in 2 ml microcentrifuge 
tubes and centrifuged at 14, 000 X G in the microfuge for 
10 minutes to remove any additional cell debris. The 
supernatant from the 2ml microcentrifuge tubes was 
transferred into a 15ml polypropylene snapcap tube and 
adjusted to a density of 1.34 g/ml with cesium chloride 
(CsCl) . The volume of the virus solution was estimated and 
0.55 g/ml of CsCl added. The CsCl was dissolved and 1 ml 
of this solution weighed 1.34 g. The solution was 
transferred polycarbonate thick-walled centrifuge tubes 
3.2ml (Beckman) and spin at 80,000rpm (348,000 X G) for 3- 
4 hours at 25°C in a Beckman Optima ' TLX 
microultracentrifuge with the TLA-100.4 rotor. The virus 
formed a white band. Using wide-bore pipette tips, the 
virus band was collected. 

The virus from the gradient has a large amount 
of CsCl which must be removed before it can be used on 
cells. Pharmacia PD-10 columns prepacked with Sephadex G- 
25M (Pharmacia) were used to desalt the virus preparation. 
The column was equilibrated with 20 ml of PBS. The virus 
was loaded and allowed to run into the column. 5 ml of 
PBS was added to the column and fractions of 8-10 drops 
collected. The optical densities of 1:50 dilutions of 
each fraction was determined at 2 60 nm on a 
spectrophotometer. A clear absorbance peak was present 
between fractions 7-12. These fractions were pooled and 
the optical density (OD) of a 1:25 dilution determined. A 
formula is used to convert OD into virus concentration: 
(OD at 260nm) (25) (1.1 x 10 12 ) = virions/ml. The OD of a 
1:25 dilution of the zctgf4 rAdV was 0.221, giving a virus 
concentration of 6 X 10 virions/ml. 

To store the virus, glycerol was added to the 
purified virus to a final concentration of 15%, mixed 
gently but effectively, and stored in aliquots at -80°C. 
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F. Tissue Culture Infectious Dose at 50% CPE (TCID 50) 
Viral Titration Assay 

A protocol developed by Quantum Biotechnologies, 
Inc. (Montreal, Qc. Canada) was followed to measure 
5 recombinant virus infectivity. Briefly, two 96-well 
tissue culture plates were seeded with 1X10 4 293A cells 
per well in MEM containing 2% fetal bovine serum for each 
recombinant virus to be assayed. After 24 hours 10-fold 
dilutions of each virus from 1X10" 2 to 1X10" 14 were made in 

10 MEM containing 2% fetal bovine serum. 100[j,l of each 
dilution was placed in each of 20 wells. After 5 days at 
37°C, wells were read either positive or negative 1 for 
Cytopathic Effect (CPE) and a value for "Plaque Forming 
Units/ml" (PFU) is calculated. 

15 TCID 50 formulation used was as per Quantum 

Biotechnologies, Inc., above. The titer (T) is determined 
from a plate where virus used is diluted from 10 " to 
10" 14 , and read 5 days after the infection. At each 
dilution a ratio (R) of positive wells for CPE per the 

20 total number of wells is determined. 

To Calculate titer of the undiluted virus 
sample: the factor, "F" = l+d(S-0.5); where XN S" is the sum 
of the ratios (R) ; and X M" is LoglO of the dilution 
series, for example, "d" is equal to 1 for a ten-fold 

25 dilution series. The titer of the undiluted sample is T = 
10 u+f) = TCID 50 /ml. To convert TCID 50 /ml to pfu/ml, 0.7 is 

subtracted from the exponent in the calculation for titer 

10 

(T) . The zCTGF4 adenovirus had a titer of 7.1 X 10 
pfu/ml . 

30 

Example 7 

Transgenic Expression 

35 Transgenic animals expressing zCTGF4 genes were 

made using adult, (B6D2fl, 2-8 months, (Taconic Farms)), 
prepubescent fertile females (donors) (B6C3fl, 4-5 weeks, 



92 



(Taconic Farms)) and adult fertile females (B6D2fl, 2-4 
months, (Taconic Farms) as parents. 

The donors were injected with approximately 8 
IU/mouse of Pregnant Mare's Serum gonadotrophin {Sigma, 
St. Louis, MO) I. P., and 46-47 hours later, 8 IU/mouse of 
human Chorionic Gonadotropin (hCG (Sigma) ) were 
administered I . P . to induce superovulat ion . 

Fertilized eggs were collected and stored in a 
37°C/5% C0 2 ingubator until microinjection. 

10-20 micrograms of plasmid DNA containing a 
cDNA of the ZCTGF4 gene was linearized, gel-purified," and 
resuspended in 10 mM Tris pH 7.4, 0.25 mM EDTA pH 8 . o\ at 
a final concentration of 5-10 nanograms per microliter for 
microinj ection . 

Plasmid DNA was microinj ected into harvested 
eggs and were penetrated with an injection needle, into 
one or both of the haploid pronuclei . 

The following day 2-cell embryos were 
transferred into pseudopregnant recipients. The recipients 
were returned to cages in pairs, and allowed 19-21 days 
gestation. After birth, 19-21 days postpartum was allowed 
before weaning. The 25 weanlings were sexed and placed 
into separate sex cages, and a 0.5 cm biopsy (used for 
genotyping) was snipped off the tail with clean scissors. 

Genomic DNA was prepared from the tail snips 
using a Qiagen Dneasy kit following the manufacturer's 
instructions. Genomic DNA was analyzed by PCR using 
primers designed to the human growth hormone (hGH) 3 1 UTR 
portion of the transgenic vector. A region unique to the 
human sequence was identified from an alignment of the 
human and mouse growth hormone 3 1 UTR DNA sequences, 
ensuring that the PCR reaction does not amplify the mouse 
sequence. Primers zcl7251 (SEQ ID NO: 31) and zcl7252 
(SEQ ID NO: 32) amplify a 368 base pair fragment of hGH. 
In addition, primers zc!7156 (SEQ ID NO: 33) and zcl7157 
(SEQ ID NO: 34) , which hybridize to vector sequences and 
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amplify the cDNA insert, were often used along with the 
hGH primers. In these experiments, DNA from animals 
positive for the transgene generated two bands, a 3 68 base 
pair band corresponding to the hGH 3 1 UTR fragment and a 
band of variable size corresponding to the cDNA insert. 

Once 9 animals were confirmed to be transgenic 
(TG) , they were back with C57B1/6 wild-type mates. As 
pups were born and weaned, the sexes were separated, and 
their tails snipped for genotyping. 

To check for expression of a transgene in a live 
animal, a small partial hepatic biopsy was collected.- The 
collected liver biopsy was transferred to a 14' ml 
polypropylene round bottom tube and snap frozen in liquid 
nitrogen and then stored on dry ice. 

Analysis of the mRNA expression level of each 
transgene was done using an RNA solution hybridization 
assay. In addition, transgenic mice were observed to be 
smaller than wild-type mice from the same litter. Thymus 
and spleen weights were observed to be lower when 
normalized by brain-to-organ weights . Histological 
examination revealed that most of the transgenic mice had 
some pancreatic atrophy and some cardiomyopathy. 

From the foregoing, it will be appreciated that, 
although specific embodiments of the invention have been 
described herein for purposes of illustration, various 
modifications may be made without deviating from the 
spirit and scope of the invention. Accordingly, the 
invention is not limited except as by the appended claims. 



